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Date: 23 Sef^tf Express Mail Label No. £L L^lb 7.7 7?> t)<Z 



Inventors: Andreas Stahl, David J. Hirsch, Harvey F. Lodish, Ruth E. Gimeno and 

Louis A. Tartaglia 
Attorney's Docket No.: WHI97-21p3MC2 

FATTY ACID TRANSPORT PROTEINS 

RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. Patent Application Number 
09/232,201 filed January 14, 1999, which claims the benefit of U.S. Provisional 
5 Application No. 60/1 10,941 filed December 4, 1998; U.S. Provisional Application No. 
60/093,491 filed July 20, 1998; and U.S. Provisional Application No. 60/071,374 filed 
January 15, 1998. The teachings of each of these referenced applications are 
incorporated herein by reference in their entirety. 

GOVERNMENT SUPPORT 
0 The invention was supported, in whole or in part, by a grant from the National 

Heart, Lung, and Blood Institute (HL41484), by National Institutes of Health Grant DK 
47618 and National Institutes of Health Grant 5 T32 CA 09541. The United States 
Government has certain rights in the invention. 

BACKGROUND OF THE INVENTION 
5 Long chain fatty acids (LCFAs) are an important source of energy for most 

organisms. They also function as blood hormones, regulating key metabolic functions 
such as hepatic glucose production. Although LCFAs can diffuse through the 
hydrophobic core of the plasma membrane into cells, this nonspecific transport cannot 
account for the high affinity and specific transport of LCFAs exhibited by cells such as 
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cardiac muscle, hepatocytes, enterocytes, and adipocytes. The molecular mechanisms 
of LCFA transport remains largely unknown. Identifying these mechanisms can lead to 
pharmaceuticals that modulate fatty acid uptake by the intestine and by other organs, 
thereby alleviating certain medical conditions (e.g. obesity). 

5 SUMMARY OF THE INVENTION 

Described herein is a diverse family of fatty acid transport proteins (FATPs) 
which are evolutionarily conserved; these FATPs are plasma membrane proteins which 
mediate transport of LCFAs across the membranes and into cells. Members of the 
FATP family described herein are present in a wide variety of organisms, from 

1 0 mycobacteria to humans, and exhibit very different expression patterns in tissues among 
the organisms. FATP family members are expressed in prokaryotic and eukaryotic 
organisms and comprise characteristic amino acid domains or sequences which are 
highly conserved across family members. In addition, the function of the FATP gene 
family is conserved throughout evolution, as shown by the fact that the Caenorhabditis 

15 (C). elegans and mycobacterial FATPs described herein facilitate LCFA uptake when 
they are overexpressed in COS cells ox Escherichia (E.) coli, respectively. FATPs are 
expressed in a wide variety of tissues, including all tissues which are important to fatty 
acid metabolism (uptake and processing). 

In specific embodiments, FATPs of the present invention are from such diverse 

2 0 organisms as humans (Homo (H.) sapiens), mice, (Mus (M.) musculus), F. rubripes, C. 
elegans, Drosophila (D.) melanogaster, Saccharomyces (S.) cerevisiae, Aspergillus 
nidulans, Cochliobolu heterostrophus, Magnaporthe grisea and Mycobacterium (M.), 
such as M. tuberculosis. As described herein, four novel mouse FATPs, referred to as 
mmFATP2, mmFATP3, mmFATP4 and mmFATP5, and six human FATPs, referred to 

25 as hsFATP 1 , hsF ATP2, hsFATP3, hsFATP4, hsFATP5 and hsFATP6, have been 
identified. All four novel murine FATPs (mmFATP2-5) and a previously identified 
murine FATP (renamed herein FATP1) have orthologs in humans (hsFATPl-5); the 
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sixth human FATP (hsFATP6) does not as yet have a mouse ortholog. The expression 
patterns of these FATPs vary, as described in detail below. 

The present invention relates to FATP family members from prokaryotes and 
eukaryotes, nucleic acids (DNA, RNA) encoding FATPs, and nucleic acids which are 
5 useful as probes or primers (e.g., for use in hybridization methods, amplification 
methods) for example, in methods of detecting FATP-encoding genes, producing 
FATPs, and purifying or isolating FATP-encoding DNA or RNA. Also the subject of 
this invention are antibodies (polyclonal or monoclonal) which bind an FATP or 
FATPs; methods of identifying additional FATP family members (for example, 
1 0 orthologs of those FATPs described herein by amino acid sequence) and variant alleles 
of known FATP genes; methods of identifying compounds which bind to an FATP, or 
modulate or alter (enhance or inhibit) FATP function; compounds which modulate or 
alter FATP function; methods of modulating or altering (enhancing or inhibiting) FATP 
function and, thus, LCFA uptake into tissues of a mammal (e.g. human) by 
1 5 administering a compound or molecule (a drug or agent) which increases or reduces 
FATP activity; and methods of targeting compounds to tissues by administering a 
complex of the compound to be targeted to tissues and a component which is bound by 
an FATP present on cells of the tissues to which the compound is to be targeted. For 
example, a complex of a drug to be delivered to the liver and a component which is 
2 o bound by an FATP present on liver cells (e.g., FATP5) can be administered. 

In one embodiment, the present invention relates to modulating or altering 
(enhancing or inhibiting/reducing) LCFA uptake in the small intestine and, thus, 
increasing or reducing the number of calories in the form of fats available to an 
individual. In another embodiment, the present invention relates to inhibiting or 
2 5 reducing LCFA uptake in the small intestine in order to reduce circulating fatty acid 
levels; that is, LCFA uptake in the small intestine is reduced and, therefore, circulating 
(blood) levels are not as high as they otherwise would be. FATP4 has been shown to be 
expressed in epithelial cells of the small intestine and particularly in the brush border 
layer of the small intestine. FATP2 has also been shown to be expressed at low levels 
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in epithelial cells of the small intestine, particularly in the duodenum. In contrast, 
FATP1, FATP3, FATP5 and FATP6 were not detected in any of the intestinal tissues. 
Thus, also described herein are FATPs which are present in the epithelial cell layer of 
the small intestine where they mediate LCFA uptake. These FATPs, particularly 
5 FATP4 and also FATP2, are targets for methods and drugs which block their function 
or activity and are useful in treating obesity, diabetes and heart disease. The ability of 
these FATPs to mediate fat uptake can be modulated or altered (enhanced or inhibited), 
thus modulating fat uptake in the small intestine. This can be done, for example, by 
administering to an individual, such as a human or other animal, a drug which blocks 
1 0 interaction of LCFAs with FATP4 and/or FATP2 in the small intestine, thus inhibiting 
LCFA passage into the cells of the small intestine. As a result, fat absorption is reduced 
and, although the individual has consumed a certain quantity of fat, the LCFAs are not 
absorbed to the same extent they would have been in the absence of the compound 
administered. 

1 5 Thus, one embodiment of this invention is a method of reducing LCFA uptake 

(absorption) in the small intestine and, as a result, reducing caloric uptake in the form of 
fat. A further embodiment is a compound (drug) useful in inhibiting or reducing fat 
absorption in the small intestine. In another embodiment, the invention is a method of 
reducing circulating fatty acid levels by administering to an individual a compound 

2 0 which blocks interactions of LCFAs with FATP4 and/or FATP2 in the small intestine, 
thus inhibiting LCFA passage into cells of the small intestine. As a result, fatty acids 
pass into the circulatory system at a diminished level and/or rate, and circulating fatty 
acid levels are lower than they would be in the absence of the compound administered. 
This method is particularly useful for therapy in individuals who are at risk for or have 

2 5 hyperlipidemia. That is, it can be used to prevent the occurrence of elevated levels of 
lipids in the blood or to treat an individual in whom blood lipid levels are elevated. 
Also the subject of this invention is a method of identifying compounds which alter 
FATP function (and thus, in the case of FATP2 and/or FATP4, alter LCFA uptake in 
the small intestine). 



In another embodiment, the present invention relates to a method of modulating 
or altering (enhancing or inhibiting) the function of FATP6, which is expressed at high 
levels in the heart. A method of inhibiting FATP6 function is useful, for example, in 
individuals with heart disease, such as ischemia, since reducing LCFA uptake into heart 
muscle in an individual who has ischemic heart disease, which may be manifested by, 
for example, angina or heart attack, can reduce symptoms or reduce the extent of 
damage caused by the ischemia. In this embodiment, a drug which inhibits FATP6 
function is administered to an individual who has had or is having a heart attack, to 
reduce LCFA uptake by the individual's heart and, as a result, reduce the damage caused 
by ischemia. In a further embodiment, this invention is a method of targeting a 
compound, such as a therapeutic drug or an imaging reagent, to heart tissue by 
administering to an individual (e.g., a human) a complex of the compound and a 
component (e.g., a LCFA or LCFA-like compound) which is bound by an FATP (e.g., 
FATP6) present in cells of heart tissue. 

In a further embodiment, LCFA uptake by the liver is modulated or altered 
(enhanced or reduced), in an individual. For example, a drug which inhibits the 
function of an FATP present in liver (e.g., FATP5) is administered to an individual who 
is diabetic, in order to reduce LCFA uptake by liver cells and, thus reduce insulin 
resistance. 

The present invention, thus, provides methods which are useful to alter, 
particularly reduce, LCFA uptake in individuals and, as a result, to alter (particularly 
reduce), availability of the LCFAs for further metabolism. In a specific embodiment, 
the present invention provides methods useful to reduce LCFA uptake and, thus, fatty 
acid metabolism in individuals, with the result that caloric availability from fats is 
reduced, and circulating fatty acid levels are lower than they otherwise would be. These 
methods are useful, for example, as a means of weight control in individuals, (e.g., 
humans) and as a means of preventing elevated serum lipid levels or reducing serum 
lipid levels in humans. FATPs expressed in the small intestine, such as FATP4, are 
useful targets to be blocked in treating obesity (e.g., chronic obesity) or to be enhanced 
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in treating conditions in which enhanced LCFA uptake is desired (e.g., malabsorption 
syndrome or other wasting conditions). 

The identification of this evolutionarily conserved fatty acid transporter family 
will allow a better understanding of the mechanisms whereby LCFAs traverse the lipid 
5 bilayer as well as yield insight into the control of energy homeostasis and its 
dysregulation in diseases such as diabetes and obesity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the amino acid sequence alignment of FATPs: mmFATPl (SEQ 
ID NO:92), mmFATP2 (SEQ ID NO:93), mmFATP3 (SEQ ID NO:94), mmFATP4 

1 0 (SEQ ID NO:95), mmFATP5 (SEQ ID NO:96), ceFATPa (SEQ ID NO:97), scFATP 
(SEQ ID NO:98) and mtFATP (SEQ ID NO:99). The underlining (amino acid residues 
204-212 of mtFATP) indicates an AMP binding motif which is found in many classes 
of proteins; the underhning at amino acid residues 204-507 of the mtFATP sequence 
indicates the FATP 360 amino acid signature sequence. 

15 Figures 2A-2D show results of LCFA uptake assays. Figures 2 A-2D: COS 

cells were cotransfected using the DEAE-dextran method with the mammalian 
expression vectors pCDNA-CD2 either alone (control; Figure 2A) or in combination 
with one of the FATP-containing expression vectors (pCDNA-mmFATP 1 , Figure 2B; 
pCDNA-mmFATP2, Figure 2C; or pCMV-SPORT2-mmFATP5, Figure 2D) as 

2 0 described in Materials and Methods for Example 2. COS cells were gated on forward 
scatter (FSC) and side scatter (SS), and the results shown represent >1 0,000 cells. Cells 
exhibiting >300 CD2 fluorescence units (vertical line) representing 15% of all cells 
were deemed CD2 positive. 

Figure 3 is a graph of fluorescence of cells expressing a FATP gene. As in 

2 5 Figures 2A-2D, COS cells were cotransfected with pCDNA-CD2 either alone (control) 
or in combination with one of the FATP-containing expression vectors (pCDNA- 
mmFATPl, pCDNA-mmFATP2, pCMV-SPORT2-mmFATP5, or pCDNA-ceFATPb). 
The mean BODIPY-FA fluorescence of the CD2-positive cells is plotted; results shown 
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represent the average of three experiments, each consisting of greater than 50,000 COS 

cells. Note that a logarithmic scale is used on the ordinate. 

Figure 4 is a graph of the uptake of palmitate with time. The full-length coding 

region of mtFATP (squares) or a control protein (TFE3; circles) was suhcloned into the 
5 inducible, prokaryotic expression vector pET (Novagen). Expression from the resulting 

plasmid was induced (solid symbols) in transformed E. coli cells with 1 mM isopropyl- 

p-D-thiogalactoside (IPTG) for 1 hour, or cells were left uninduced (open symbols). 

Data points were done in triplicate and counts were normalized to the number of 

bacteria as determined by OD^. 
1 0 Figure 5 is a phylogenetic tree produced by aligning complete and partial 

sequences for FATP genes from human, rat, mouse, puffer fish, D. melanogaster, C. 

elegans, S. cerevisiae, and M. tuberculosis using ClustalX and using these data to 

produce a phylogenetic tree using TreeViewPPC. The bar indicates the number of 

substitutions per residue, i.e., 0.1 corresponds to a distance of 10 substitutions per 100 
15 residues. 

Figure 6 shows a comparison of the FATP signature sequences of mmFATPl 
(SEQ ID NO:l), mmFATP5, (SEQ ID NO:2), ceFATPa (SEQ ID NO:3), scFATP (SEQ 
ID NO:4) and mtFATP (SEQ ID NO:5). 

Figure 7 shows the sequence identity among the FATP family members and 
2 0 VLACs, based on the 360 amino acid signature sequence of FATP from Figure 1 . 

Figures 8A and 8B are the mmFATP3 DNA sequence (SEQ ID NO:6). 

Figure 9 is the mmFATP3 protein sequence (SEQ ID NO:7). 

Figures 10A and 10B are the mmFATP4 DNA sequence (SEQ ED NO:8). 

Figure 1 1 is the mmFATP4 protein sequence (SEQ ID NO:9). 
2 5 Figures 12A and 12B are the mmFATP5 DNA sequence (SEQ LD NO: 1 0). 

Figure 13 is the mmFATP5 protein sequence (SEQ ID NO: 11). 

Figures 14A and 14B are the hsFATP2 DNA sequence (SEQ ED NO: 12). 

Figure 15 is the hsFATP2 protein sequence (SEQ ID NO: 13). 

Figures 16A and 16B are the hsFATP3 DNA sequence (SEQ ID NO: 14). 
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Figure 17 is the hsFATP3 protein sequence (SEQ ID NO: 15). 
Figures 18A and 18B are the hsFATP4 DNA sequence (SEQ ID NO: 16). 
Figure 19 is the hsFATP4 protein sequence (SEQ ID NO: 17). 
Figures 20A and 20B are the hsFATP5 DNA sequence (SEQ ID NO: 18). 
5 Figure 21 is the hsFATP5 protein sequence (SEQ ID NO:19). 

Figures 22A and 22B are the hsFATP6 DNA sequence (SEQ ID NO:20). 
Figure 23 is the hsFATP6 protein sequence (SEQ ID NO:21). 
Figures 24A and 24B are the mtFATP DNA sequence (SEQ ID NO:22). 
Figure 25 is the mtFATP protein sequence (SEQ ID NO:23). 
1 0 Figure 26 shows the DNA sequence (SEQ ID NO:24) and predicted amino acid 

sequence (SEQ ID NO:25) of human FATP1 . 

Figure 27 shows the DNA sequence (SEQ ID NO:26) and predicted amino acid 
sequence (SEQ ID NO:27) of human FATP4. 

Figure 28A is a hydrophobicity plot for hsFATPl, showing that it has multiple 
15 membrane-spanning domains. 

Figure 28B is the amino acid composition of hsFATPl. 
Figure 28C is a hydrophilicity plot for hsFATPl, made using the Kyte-Doolittle 
method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figure 29A is a hydrophobicity plot for hsFATP4, showing that it has multiple 
2 0 membrane-spanning domains. 

Figure 29B is a listing of the amino acid composition of hsFATP4. 
Figure 29C is a hydrophilicity plot for hsFATP4, made using the Kyte-Doolittle 
method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figures 30A and 30B show a comparison of the nucleotide sequence of human 
2 5 FATP1 (SEQ ID NO:28) and the nucleotide sequence of mouse FATP1 (SEQ ID 
NO:29). 

Figures 31 A and 3 IB show a comparison of the nucleotide sequence of human 
FATP4 (SEQ ID NO:30) and the nucleotide sequence of mouse FATP4 (SEQ ID 
NO:31). 
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Figure 32 shows a comparison of the amino acid sequence of human FATP1 
(SEQ ID NO:32) and the amino acid sequence of mouse FATP1 (SEQ ID NO:33). 
Shaded amino acid residues match the concensus sequence exactly 

Figure 33 shows a comparison at the amino acid level of human FATP4 (SEQ 
5 ID NO:34) and mouse FATP4 (SEQ ID NO:35). Shaded amino acid residues match the 
concensus sequence exactly. 

Figure 34 shows the nucleotide sequence (SEQ ID NO:36) and predicted amino 
acid sequence (SEQ ID NO:37) of hsFATP6. 

Figure 35A is a hydrophobicity plot for hsFATP6, showing that it has multiple 
10 membrane-spanning domains. 

Figure 35B is a listing of the amino acid composition of hsFATP6. 

Figure 35C is a hydrophilicity plot for hsFATP6, made using the Kyte-Doolittle 
method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figure 36 shows an alignment of the amino acid sequences of hsFATPl (SEQ 
15 ID NO:38), hsFATP4 (SEQ ID NO:39) and hsFATP6 (SEQ ID NO:40). Shaded amino 
acid residues match the concensus sequence exactly. 

Figure 37 shows results of assessment of fatty acid uptake by human FATP1 and 
human FATP4. The percent of CD2-positive cells exhibiting a BODIPY-fluorescence 
of more than 300 arbitrary units is plotted for the three different conditions tested. 
2 0 Figure 38 is a graph showing uptake of tritiated oleate, with time, by 293 cells 

transfected with either (diamonds) a plasmid for expression of human FATP4 or 
(squares) a control plasmid. 

Figure 39 is an illustration of the amino acid sequences of human FATP4 (SEQ 
ID NO:41) and mouse FATP4 (SEQ ID NO:42) compared to human FATP1 (SEQ ID 
2 5 NO:43). Shown by underlining are the FATP consensus sequence (236-556 of 
hsFATPl) and the AMP-binding motif (246-254 of hsFATPl). The human FATPs 
were cloned by screening libraries with sequences from ESTs (expressed sequence 
tags). Mouse FATP4 was cloned by PCR using degenerate primers. 
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Figure 40 is a graph showing the uptake, with time, of tritiated oleate by mouse 
enterocytes in the presence of no oligonucleotide (squares), sense oligonucleotide 
(circles) or antisense oligonucleotide (diamonds). 

Figure 41 is a bar graph showing uptake of tritiated oleate, by mouse enterocytes 
5 in the presence of various concentrations of antisense (solid bars), mismatch (stippled 
bars) or sense (lined bars) oligonucleotides. 

Figure 42 is a bar graph showing uptake of tritiated oleate and uptake of 35 S- 
labeled methionine by mouse enterocytes to which were added no oligonucleotide, the 
antisense oligonucleotide, or the mismatch oligonucleotide. 
1 0 Figure 43 A is the nucleotide sequence of the gene encoding mouse FATP4 (SEQ 

ID NO:44). 

Figure 43B is the amino acid sequence of mouse FATP4 protein (SEQ ID 
NO:45). 

Figures 44A, 44B, and 44C are the hsFATPl DNA sequence (SEQ ID NO:46). 
1 5 Coding region: 175-21 15 (1941 nt). 

Figure 45 is the hsFATPl protein sequence (SEQ ID NO:47). 
Figures 46A and 46B are the hsFATP2 DNA sequence (SEQ ID NO:48). 
Coding region: 223-2085 (1863 nt). 

Figure 47 is the hsFATP2 protein sequence (SEQ ID NO:49). 
2 0 Figure 48 is the partial DNA sequence of hsFATP3 (SEQ ID NO:50). Coding 

region: 1-993. 

Figure 49 is the partial protein sequence of hsFATP3 (SEQ ID NO:51). 
Figures 50A, 50B, and 50C are the hsFATP4 DNA sequence (SEQ ID NO:52). 
Coding region: 208-2139 (1932 nt). 
2 5 Figure 5 1 is the hsFATP4 protein sequence (SEQ ID NO:53). 

Figure 52 is the hsFATP5 partial DNA sequence (SEQ ID NO:54). Coding 
region: 1-1062. 

Figure 53 is the hsFATP5 partial protein sequence (SEQ ID NO:55). 
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Figures 54A, 54B, and 54C are the hsFATP6 DNA sequence (SEQ ID NO:56). 
Coding region: 643-2502 (1860 nt). 

Figure 55 is the hsFATP6 protein sequence (SEQ ID NO:57). 

Figures 56A, 56B, and 56C are the rnFATPl DNA sequence (m=Rattus 
norvegicus; (SEQ ID NO:58). Coding region: 75-2015 (1941 nt). 

Figure 57 is the rnFATPl protein sequence (SEQ ID NO:59). 

Figure 58A, 58B, and 58C are the rnFATP2 DNA sequence (SEQ ID NO:60). 
Coding region: 795-2657 (1863 nt). 

Figure 59 is the rnFATP2 protein sequence (SEQ ID NO:61). 

Figure 60A and 60B are the rnFATP4 partial DNA sequence (SEQ ID NO:62). 
Coding region: 1-1218. 

Figure 61 is the rnFATP4 partial DNA sequence (SEQ ID NO:63). 

Figure 62A, 62B, and 62C are the mmFATPl DNA sequence (SEQ ID NO:64). 
Coding region: 1-1944. 

Figure 63 is the mmFATPl protein sequence (SEQ ID NO:65). 

Figures 64A and 64B are the mmFATP2 DNA sequence (SEQ ID NO:66). 
Coding region: 121-1992 (1872 nt). 

Figure 65 is the mmFATP2 protein sequence (SEQ ID NO:67). 

Figures 66A and 66B are the mmFATP3 partial DNA sequence (SEQ ID 
NO:68). Coding region: 1-1830. 

Figure 67 is the mmFATP3 partial protein sequence (SEQ ID NO:69). 

Figures 68A, 68B, and 68C are the mmFATP4 DNA sequence (SEQ ID NO:70). 
Coding region: 1-1932. 

Figures 69 is the mmFATP4 protein sequence (SEQ ID NO:71). 

Figures 70A and 70B are the mmFATP5 DNA sequence (SEQ ID NO:72). 
Coding region: 60-2129. 

Figure 71 is the mmFATP5 protein sequence (SEQ ID NO:73). 

Figures 72A and 72B are the dmFATP partial DNA sequence (<bn=Drosophila 
melanogaster; SEQIDNO:74). Coding region: 1-1773. 
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Figures 73 is the dmFATP partial protein sequence (SEQ ID NO:75). 
Figure 74 is the drFATP partial DNA sequence (dx=Danio rerio, zebrafish; SEQ 
IDNO:76) Coding region: 1-173. 

Figure 75 is the drFATP partial protein sequence (SEQ ID NO:77). 
5 Figure 76A and 76B are the ceFATPa DNA sequence (SEQ ID NO:78). Coding 

region: 1-1953. 

Figure 77 is the ceFATPa protein sequence (SEQ ID NO:79). 
Figures 78A and 78B are the ceFATPb DNA sequence (SEQ ID NO:80). 
Coding region: 1-1968. 
0 Figure 79 is the ceFATPb protein sequence (SEQ ID NO: 8 1). 

Figures 80A and 80B are the chFATP DNA sequence (SEQ ID NO:82; 
ch=Cochliobolu heterostrophns). Coding region: 1-1932. 

Figure 81 is the chFATP protein sequence (SEQ ID NO:83). 
Figure 82 is the anFATP partial protein sequence (an=Aspergillus nidulans; 
5 SEQ ID NO:84). Coding region: 1-597. 

Figure 83 is the anFATP partial protein sequence (SEQ ID NO:85). 
Figure 84 is the mgFATP partial DNA sequence (mg= Magnaporthe grisea, rice 
blast; SEQ ID NO:86). Coding region: 1-522. 

Figure 85 is the mgFATP partial protein sequence (SEQ ID NO:87). 
0 Figures 86A and 86B are the scFATP DNA sequence (SEQ ID NO:88). Coding 

region: 1-1872. 

Figure 87 is the scFATP protein sequence (SEQ ID NO:89). 

Figures 88A and 88B are the mtFATP DNA sequence (SEQ ID NO:90). 

Figure 89 is the mtFATP protein sequence (SEQ ED NO:91). Coding region: 1- 

5 1794. 

Figure 90 is a concensus sequence of the FATP signature sequence (SEQ ID 
NO:100), based on 23 independent sequences aligned in ClustalX. The height of the bar 
at each amino acid residue position indicates the degree of conservation at that position. 
Gaps have been inserted to maintain the strength of the alignment. 
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Figure 91 is a hydrophilicity plot for hsFATP2, made using the Kyte-Doolittle 
method, averaging hydrophilicity values for 18 amino acid residues at a time. 

Figure 92 is a hydrophilicity plot for the hsFATP3 partial protein, made using 
the Kyte-Doolittle method, averaging hydrophilicity values for 18 amino acid residues 
5 at a time. 

Figure 93 is a hydrophilicity plot for the hsFATP5 partial protein, made using 
the Kyte-Doolittle method, averaging hydrophilicity values for 18 amino acid residues 
at a time. 

Figures 94A and 94B are a representation of the DNA sequence (SEQ ID 
10 NO:101) of the hsFATP3 gene, and the amino acid sequence (SEQ ID NO:102) of the 
□ hsFATP3 protein. 

5 Figure 95. Mammalian expression constructs containing either hsFATP4 

% (squares and triangles) or empty control vector (circles) were stably transfected into 293 

J! cells - Short-term uptake of Bodipy-FA in the presence of BSA was determined by 

Z 15 FACS. The mean fluorescence of the viable cell population is expressed in arbitrary 

^ fluorescence units. FATP4 protein expression was determined by densitometry of anti- 

M FATP4 Western blots, and is expressed in arbitrary units. 

J Figure 96. Short-term uptake of Bodipy-palmitate (1 \lM), either by control 

0 cells (black bars) or FATP4-expressing cells (hatched bars), was measured in the 

2 0 presence of 0, 1 0, 1 00 |iM unlabeled palmitate. FA uptake was quantified by FACS 
and expressed in arbitrary fluorescence units. 

Figure 97. The rate of [ 2 H]palmitate uptake by 293 cells, which were stably 
transfected with a construct for either human FATP4 (diamonds) or an empty vector 
(circles), was compared to that of isolated enterocytes (squares). 
2 5 Figure 98. Isolated enterocytes were incubated for 48h with increasing 

concentrations of the FATP4 antisense oligonucleotide or with 100 |lM of a randomized 
control oligonucleotide with identical nucleotide composition to the FATP4 antisense 
oligonucleotide. The uptake of oleate by the enterocytes was then measured over a 5 
min time interval (solid bars). In parallel, the levels of FATP4 protein and, as a loading 
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control, P-catenin, were determined by Western blotting and quantitated using 
densitometry (hatched bars). FA uptake and FATP4 protein levels were normalized to 
that of untreated cells. The averages and standard deviations of 4 independent 
experiments are shown. 
5 Figure 99. Uptake rates of [ 3 H]oleate, [ 3 H]palmitate and [ 35 S]methionine by 

primary enterocytes were measured after 48h incubation with either 100 |lM FATP4 
antisense (solid bars) or 100 |lM randomized control oligonucleotide (hatched bars) 
and expressed as % of untreated cells. 

DETAILED DESCRIPTION OF THE INVENTION 

10 As described herein, FATPs are a large evolutionarily conserved family of 

proteins that mediate the transport of LCFAs into cells. The family includes proteins 
which are conserved from mycobacteria to humans and exhibit very different expression 
patterns in tissues. Specific embodiments described include FATPs from mice, humans, 
nematodes, fungi and mycobacteria which have been shown to be functional LCFA 

15 transporters. The term "fatty acid transport proteins" ("FATPs") as used herein, refers 
to the proteins described herein as FATP1, FATP2, FATP3, FATP4, FATPS and 
FATP6, which have been described in one or more species of mammals, as well as 
mtFATP, ceFATP, scFATP, anFATP, mgFATP, and chFATP, and other proteins 
sharing at least about 50% amino acid sequence similarity, preferably at least about 60% 

2 0 sequence similarity, more preferably at least about 70% sequence similarity, and still 
more preferably, at least about 80% sequence similarity, and most preferably, at least 
about 90% sequence similarity in the approximately 360 amino acid signature sequence. 
The approximaely 360 amino acid FATP signature sequence is shown in Figure 1. The 
concensus sequence of the signature sequence is shown in Figure 90. The nomenclature 

2 5 used herein to refer to FATPs includes a species-specific prefix (e.g., mm, Mus 

musculus; hs or h, Homo sapiens or human; mt M tuberculosis; dm. D. melanogaster; 
ce, C elegans; sc, Saccharomyces cerevisiae) and a number such that mammalian 
homologues in different species share the same number. For example, six human and 
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five mouse FATP genes which are expressed in a variety of tissues are described herein 
and are referred to, respectively, as hsFATP 1 -hsFATP6 and mmFATPl-mmFATP5; for 
example, hsFATP4 and mmFATP4 are the human and mouse orthologs. 

Expression patterns of human and mouse FATPs have been assessed and are 
described below. Briefly, results of these assessments show that FATP5 is a liver- 
specific gene. FATP2 is highly expressed in liver and kidney. Both of these proteins, 
as well as FATP4 and FATPs from nematodes and mycobacteria, have been shown to 
be functional LCFA transporters. Results have also shown that FATP4 mRNA is 
present at high levels in epithelial cells of two regions of the small intestine (the 
jejunum and ileum) and at lower, but significant, levels in a third region (the 
duodenum). They further showed that FATP2 mRNA is present in epithelial cells of the 
duodenum at a level similar to that of FATP4 mRNA levels, but is present at lower 
levels in the jejunum and ileum. FATP4 mRNA was absent from other cell types of the 
small intestine and no FATP4 mRNA could be detected in any cells of the colon. No 
signals above background could be detected for FATP1, FATP3 and FATP5 in any of 
the intestinal tissues. Thus, FATP4 is the major FATP in the mouse small intestine, 
which supports a major role for FATP4 (along with FATP2 to a lesser extent) in 
absorption of free fatty acids. hsFATP4 was clearly expressed in the jejunum and 
ileum; expression was absent in the stomach. This, too, is consistent with a major role 
for FATP4 in absorption of fatty acids in the human gut. Analysis of FATP expression 
in human tissues, also described in detail below, showed that hsFATP6, which has no 
mouse ortholog as yet, is expressed at high levels in the heart and at low levels in the 
placenta, but is undetectable in the other tissues assessed (Example 9). This is 
consistent with a major role for FATP6 in absorption of fatty acids in the heart. 

Long chain fatty acids (LCFAs) are an important energy source for pro- and 
eukaryotes and are involved in diverse cellular processes, such as membrane synthesis, 
intracellular signaling, protein modification, and transcriptional regulation. In 
developed Western countries, human dietary lipids are mainly di- and triglycerides and 
account for approximately 40% of caloric intake (Weisburger, J. H. (1997) J. Am. Diet. 
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Assoc. P7:S16-S23). These lipids are broken down into fatty acids and glycerol by 
pancreatic lipases in the small intestine (Chapus, C, Rovery, M., Sarda, L & Verger, R. 
(1988) Biochimie 70:1223-34); LCFAs are then transported into brush border cells, 
where the majority is re-esterified and secreted into the lymphatic system as 
chylomicrons (Green, P.H. & Riley, J.W. (1981) AusL N.Z.J. Med. 77:84-90). Fatty 
acids are liberated from lipoproteins by the enzyme lipoprotein lipase, which is bound to 
the luminal side of endothelial cells (Scow, R.O. & Blachette-Mackie, E J. (1992) Mol 
Cell. Biochem 776:181-191). "Free" fatty acids in the circulation are bound to serum 
albumin (Spector, A.A. (1984) Clin. Physiol Biochem 2:123-134) and are rapidly 
incorporated by adipocytes, hepatocytes, and cardiac muscle cells. The latter derive 60- 
90% of their energy through the oxidation of LCFAs (Neely, J.F. Rovetto, M.J. & 
Oram, J.F. (1972) Prog. Cardiovasc. Dis: 75:289-329). Although saturable and specific 
uptake of LCFAs has been demonstrated for intestinal cells, hepatocytes, cardiac 
myocytes, and adipocytes, the molecular mechanisms of LCFA transport across the 
plasma membrane have remained controversial (Hui, T.Y. & Bernlohr, D.A. (1997) 
Front Bioscl 75:d222-31-d231; Schaffer, J.E. & Lodish, H.F, (1995) Trends 
Cardiovasc. Med. 5:218-224). Described herein is a large family of highly homologous 
mammalian LCFA transporters which show wide expression, including in all tissues 
relevant to fatty acid metabolism. Further described are novel members of this family 
in other species, including mycobacterial and nematode FATPs which, like their 
mammalian counterparts, are functional fatty acid transporters. 

The discovery of a diverse but highly homologous family of FATPs is 
reminiscent of the glucose transporter family. In a manner similar to the FATPs, the 
glucose transporters have very divergent patterns of tissue expression (McGowan, KJVL, 
Long, S.D. & Pekala, P.H. (1995) Pharmacol Ther. 65:465-505). The FATPs, like 
glucose transporters, may also differ in their substrate specificities, uptake kinetics, and 
hormonal regulation (Thorens, B. (1996) Am. J. Physiol 270:G541-G553). Indeed, the 
levels of fatty acids in the blood, like those of glucose, can be regulated by insulin and 
are dysregulated in diseases such as noninsulin-dependent diabetes and obesity (Boden, 
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G. (1997) Diabetes 46:3-10). The underlying mechanisms for the regulation of free 
fatty acid concentrations in the blood are not understood, but could be explained by 
hormonal modulation of FATPs. 

Insulin-resistance is thought to be the major defect in non insulin-dependent 
diabetes mellitus (NIDDM) and is one of the earliest manifestations of NIDDM 
(McGarry (1992) Science 258:766-770). Free fatty acids (FFAs) may provide an 
explanation for why obesity is a risk factor for NIDDM. Plasma levels of FFAs are 
elevated in diabetic patients (Reaven et al (1988) Diabetes 37: 1020). Elevated plasma 
free fatty acids (FFAs) have been demonstrated to induce insulin-resistance in whole 
animals and humans (Boden (1998) Front. Biosci. 3:D169-D175). This insulin- 
resistance is likely mediated by effects of FFAs on a variety of issues. FFAs added to 
adipocytes in vitro induce insulin resistance in this cell type as evidenced by inhibition 
of insulin-induced glucose transport (Van Epps-Fung et al (1997) Endocrinology 
138:4338-4345). Rats fed a high fat diet developed skeletal muscle insulin resistance as 
evidenced by a decrease in insulin-induced glucose uptake by skeletal muscle (Han et 
al, (1997) Diabetes 46:1761-1767). In addition, elevated plasma FFAs increase 
insulin-suppressed endogenous glucose production in the liver (Boden (1998) Front. 
Biosci 3:D169-D175), thus increasing hepatic glucose output. It has been postulated 
that the adverse effects of plasma free fatty acids are due to the FFAs being taken up 
into the cell, leading to an increase in intracellular long chain fatty acyl CoA; 
intracellular long chain acyl CoAs are thought to mediate the effects of FFAs inside the 
cell. Thus, fatty acid induced insulin-resistance may be prevented by blocking uptake 
of FFAs into select tissues, in particular liver (by blocking FATP2 and/or FATPS), 
adipocyte (by blocking FATP1), and skeletal muscle (by blocking FATP1). Blocking 
intestinal fat absorption (by blocking FATP4) is also expected to reduce plasma FFA 
levels and thus improve insulin resistance. 

During the pathogenesis of NIDDM insulin-resistance can initially be 
counteracted by increasing insulin output by the pancreatic beta cell. Ultimately, this 
compensation fails, beta cell function decreases and overt diabetes results (McGarry 
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(1992) Science 258: 766-770). Manipulating beta cell function is a second point where 
fatty acid transporter blockers may be beneficial for diabetes. While no FATP homolog 
has been identified so far that is expressed in the beta cell of the pancreas, the data 
described below suggest the existence of such a transporter and the sequence 
5 information included herein provides the means to identify such a transporter by 

degenerate PCR, using primers to regions conserved in all FATP family members or by 
low stringency hybridization. It has been demonstrated that exposure of pancreatic 
beta-cells to FFAs increases the basal rate of insulin secretion; this in turn leads to a 
decrease in the intracellular stores of insulin, resulting in decreased capacity for insulin 

10 secretion after chronic exposure (Bollheimer et al, (1998) J. Clin. Invest 101:1094- 
1 101). The effects of FFAs are again likely to be mediated by intracellular long chain 
fatty acylCoA molecules (Liu et al, (1998) J. Clin. Invest 101:1870-1875). FFAs 
have also been demonstrated to increase beta cell apoptosis (Shimabukuro et al, (1998) 
Proc. Nat Acad. Set USA 95:2498-2502), possibly contributing to the decrease in beta 

1 5 cell numbers in late stage NIDDM. 

Another finding with potentially broad implications is the identification of a 
FATP homologue in M. tuberculosis. Tuberculosis causes more deaths worldwide than 
any other infectious agent and drug-resistant tuberculosis is re-emerging as a problem in 
industrialized nations (Bloom, B.R. & Small, P.M. (1998) N. Engl J. Med. 338:611- 

2 0 678). Mycobacterium tuberculosis has about 250 enzymes involved in fatty acid 

metabolism, compared with only about 50 in E. colt It has been suggested that, living 
as a pathogen, the mycobacteria are largely lipolytic, rather than lipogenic, relying on 
the lipds within mammalian cells and the tubercle (Cole, S.T. et at, Nature 393:531- 
544 (1998)). The de novo synthesis of fatty acids in Mycobacterium leprae is 

2 5 insufficient to maintain growth (Wheeler, P.R., Bulmer, K & Ratledge, C. (1990) 1 
Gene. Microbiol 136:21 1-217). Thus, it is reasonable to expect that inhibitors of 
mtFATP will serve as therapeutics for tuberculosis. FATPs expressed in mycobacteria 
can be targeted to reduce or prevent replication of mycobacteria (e.g., to reduce or 
prevent replication of M. tuberculosis) and, thus, reduce or prevent their adverse effects. 
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For example, a FATP or FATPs expressed by M. tuberculosis can be targeted and 
inhibited, thus reducing or preventing growth of this pathogen (and tuberculosis in 
humans and other mammals). An inhibitor of an M. tuberculosis FATP can be 
identified, using methods described herein (e.g., expressing the FATP in an appropriate 
host cell, such as E. coli or COS cells; contacting the cells with an agent or drug to be 
assessed for its ability to inhibit the FATP and, as a result, mycobacterial growth, and 
assessing its effects on growth). A drug or agent identified in this manner can be further 
tested for its ability to inhibit a M tuberculosis FATP and M. tuberculosis infection in 
an appropriate animal model or in humans. A method of inhibiting mycobacterial 
growth, particularly growth of M tuberculosis, and compounds useful as drugs for 
doing so are also the subject of this invention. 

An isolated polynucleotide encoding mtFATP, like other polynucleotides 
encoding FATPs of the FATP family, can be incorporated into vectors, nucleic acids of 
viruses, and other nucleic acid constructs that can be used in various types of host cells 
to produce mtFATP. This mtFATP can be used, as it appears on the surface of cells, or 
in various artificial membrane systems, to assess fatty acid transport function, to 
identify ligands and molecules that are modulators of fatty acid transport activity. 
Molecules found to be inhibitors of mtFATP function can be incorporated into 
pharmaceutical compositions to administer to a human for the treatment of tuberculosis. 

Particular embodiments of the invention are polynucleotides encoding a FATP 
of Cochliobolus (Helminthosporium) heterostrophus or portions or variants thereof, the 
isolated or recombinantly produced FATP, methods for assessing whether an agent 
binds to the chFATP, and further methods for assessing the effect of an agent being 
tested for its ability to modulate fatty acid transport activity. Cochliobolus 
heterostrophus is an ascomycete that is the cause of southern corn leaf blight, an 
economically important threat to the corn crop in the United States. The related species 
C. sativus causes crown rot and common root rot in wheat and barley. One or more 
FATPs of C. heterostrophus can be targeted for the identification of an inhibitor of 
chFATP function, which can be then be used as an agent effective against infection of 
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plants by C. heterostrophus and related organisms. Methods described herein that were 
applied in studying the expression of a F ATP gene and the function of the FATP in its 
natural site of expression or in a host cell, can be used in the study of the chFATP gene 
and protein. 

5 Magnaporthe grisea (rice blast) is an economically important fungal pathogen of 

rice. Further embodiments of the invention are nucleic acid molecules encoding a 
FATP of Magnaporthe grisea, portions thereof, or variants thereof, isolated mgFATP, 
nucleic acid constructs, and engineered cells expressing mgFATP. Other aspects of the 
invention are assays to identify an agent which binds to mgFATP and assays to identify 
10 an agent which modulates the function of mgFATP in cells in which mgFATP is 

expressed or in artificial membrane systems. Agents identified as inhibiting mgFATP 
activity can be developed into anti-fungal agents to be used to treat rice infected with 
rice blast. 

Caenorhabditis elegans is a nematode related to plant pathogens and human 
15 parasites. An isolated polynucleotide which encodes ceFATP, like other 

polynucleotides encoding FATPs of the FATP family described herein, can be 
incorporated into nucleic acid vectors and other constructs that can be used in various 
types of cells to produce ceFATP. ceFATP as it occurs in cells or as it can be isolated 
or incorporated into various artificial or reconstructed membrane systems, can be used 
2 0 to assess fatty acid transport, and to identify ligands and agents that modulate fatty acid 
transport activity. Agents found by such assays to be inhibitors of ceFATP activity can 
be incorporated into compositions for the treatment of diseases caused by genetically 
related organisms with a FATP of similar sensitivity to the agents. 

Aspergillus nidulans is one of a family of fungal species that can infect humans. 
2 5 Further embodiments of the invention of the family of polynucleotides encoding FATPs 
are polynucleotides encoding a FATP of Aspergillus nidulans, and vectors and host 
cells that can be constructed to comprise such polynucleotides. Further embodiments 
are a polypeptide encoded by such polynucleotides, portions thereof having one or more 
functions characteristic of a FATP, and various methods. The methods include those 
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for identifying agents that bind to anFATP and those for assessing the effect of an agent 
being tested for its ability to modulate fatty acid transport activity. Those agents found 
to inhibit fatty acid transport function can be used in compositions as anti-fungal 
pharmaceuticals, or can be modified for greater effectiveness as a pharmaceutical. 

One aspect of the invention relates to isolated nucleic acids that encode a FATP 
as described herein, such as those FATPs having an amino acid sequence in Figure 45 
(SEQ ID NO:47), Figure 47 (SEQ ID NO:49), Figures 94A and 94B (SEQ ID NO:102), 
Figure 51 (SEQ ID NO:53), Figure 53 (SEQ ID NO:55), and Figure 55 (SEQ ID 
NO: 5 7) and nucleic acids closely related thereto as described herein. 

Using the information provided herein, such as a nucleic acid sequence set forth 
in Figures 44A-44C (SEQ ID NO:46), Figures 46A and 46B (SEQ ID NO:48), Figures 
94A and 94B (SEQ ID NO:101), Figures 50A-50C (SEQ ID NO:52), Figure 52 (SEQ 
ID NO:54), and Figures 54A-54C (SEQ ID NO:56), a nucleic acid of the invention 
encoding a FATP polypeptide may be obtained using standard cloning and screening 
methods, such as those for cloning and sequencing cDNA library fragments, followed 
by obtaining a full length clone. For example, to obtain a nucleic acid of the invention, 
a library of clones of cDNA of human or other mammalian DNA can be probed with a 
labeled oligonucleotide, such as a radiolabeled oligonucleotide, preferably about 17 
nucleotides or longer, derived from a partial sequence. Clones carrying DNA identical 
to that of the probe can then be distinguished using stringent (also, "high stringency") 
hybridization conditions. By sequencing the individual clones thus identified with 
sequencing primers designed from the original sequence it is then possible to extend the 
sequence in both directions to determine the full length sequence. Suitable techniques 
are described, for example, in Current Protocols in Molecular Biology (F.M. Ausubel et 
al, eds), containing supplements through Supplement 42, 1998, John Wiley and Sons, 
Inc., especially chapters 5, 6 and 7. 

Embodiments of the invention include isolated nucleic acid molecules 
comprising any of the following nucleotide sequences: 7.) a nucleotide sequence 
which encodes a protein comprising the amino acid sequence of hsFATPl (SEQ ID 
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NO:47), the amino acid sequence of hsFATP2 (SEQ ID NO:49), the amino acid 
sequence of hsFATP3 (SEQ ID NO:51), the amino acid sequence of hsFATP4 (SEQ ID 
NO: 53), the amino acid sequence of hsFATPS (SEQ ID NO:55) or the amino acid 
sequence of hsFATP6 (SEQ ID NO:57); 2) nucleotide sequences of hsFATPl, 
5 hsFATP2, hsFATP3, hsFATP4, hsFATPS, or hsFATP6 (SEQ ID NO:46, 48, 101, 52, 
54, or 56, respectively); 3.) a nucleotide sequence which is complementary to the 
nucleotide sequence of hsFATPl (SEQ ID NO:46), hsFATP2 (SEQ ED NO:48), 
hsFATP3 (SEQ ID NO: 101), hsFATP4 (SEQ ID NO:52), hsFATPS (SEQ ID NO:54) or 
hsFATP6 (SEQ ID NO:56); 4.) a nucleotide sequence which consists of the coding 

1 0 region of hsFATPl (SEQ ID NO:46), the coding region of hsFATP2 (SEQ ID NO:48), 
the coding region of hsFATP3 (SEQ ID NO: 101), the coding region of hsFATP4 (SEQ 
ID NO:52), the coding region of hsFATP5 (SEQ ID NO:54), or the coding region of 
hsFATP6 (SEQ ID NO:56). 

The invention further relates to nucleic acids (nucleic acid molecules or 

1 5 polynucleotides) having nucleotide sequences identical over their entire length to those 
shown in the figures, for instance Figures 44A-44C (SEQ ID NO:46), Figures 46A and 
46B (SEQ ID NO:48), Figures 94A and 94B (SEQ ID NO: 101), Figures 50A-50C (SEQ 
ID NO:52), Figure 52 (SEQ ID NO:54), and Figures 54A-54C (SEQ ID NO:56). It 
further relates to DNA, which due to the degeneracy of the genetic code, encodes a 

2 0 FATP encoded by one of the FATP-encoding DNAs, whose amino acid sequence is 
provided herein. Also provided by the invention are nucleic acids having the coding 
sequences for the mature polypeptides or fragments in reading frame with other coding 
sequences, such as those encoding a leader or secretory sequence, a pre-, or pro- or 
prepro- protein sequence. The nucleic acids of the invention encompass nucleic acids 

2 5 that include a single continuous region or discontinuous regions encoding the 

polypeptide, together with additional regions, that may also contain coding or non- 
coding sequences. The nucleic acids may also contain non-coding sequences, including, 
for example, but not limited to, non-coding 5' and 3' sequences, such as the transcribed, 
non-translated sequences, termination signals, ribosome binding sites, sequences that 
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stabilize mRNA, introns, polyadenylation signals, and additional coding sequences 
which encode additional amino acids. For example, a marker sequence that facilitates 
purification of the fused polypeptide can be encoded. In certain embodiments of the 
invention, the marker sequence can be a hexa-histidine peptide, as provided in the pQE 
5 vector (Qiagen, Inc.) and described in Gentz et al, Proc. Natl Acad. Set USA 86: 821- 
824 (1989), or an HA tag (Wilson et al, Cell 37: 767 (1984)), or a sequence encoding 
glutathione S-transferase of Schistosoma japonicum (vectors available from Pharmacia; 
see Smith, D.B. and Johnson K.S., Gene 57:31 (1988) and Kaelin, W.G. et al, Cell 
70:351 (1992)). Nucleic acids of the invention also include, but are not limited to, 

1 0 nucleic acids comprising a structural gene and its naturally associated sequences that 
control gene expression. 

The invention further relates to variants, including naturally-occurring allelic 
variants, of those nucleic acids described specifically herein by DNA sequence, that 
encode variants of such polypeptides as those having the amino acid sequences shown 

15 in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID NO:49), Figures 94A and 94B (SEQ 
ID NO:102), Figure 51 (SEQ ID NO:53) Figure 53 (SEQ ID NO:55), or Figure 55 (SEQ 
ID NO:57). Such variants include nucleic acids encoding variants of the above-listed 
amino acid sequences, wherein those variants have several, such as 5 to 10, 1 to 5, or 3, 
2 or 1 amino acids substituted, deleted, or added, in any combination. Variants include 

2 0 polynucleotides encoding polypeptides with at least 95% but less than 100% amino acid 
sequence identity to the polypeptides described herein by amino acid sequence. Variant 
polynucleotides hybridize, under low to high stringency conditions, to the alleles 
described herein by DNA sequence. In one embodiment, variants have silent 
substitutions, additions and deletions that do not alter the properties and activities of the 

25 FATP. Allelic variants of the polynucleotides encoding hsFATPl (Figure 45; SEQ ID 
NO:47), hsFATP2 (Figure 47; SEQ ID NO:49), hsFATP3 (Figures 94A and 94B; SEQ 
ID NO:102), hsFATP4 (Figure 51; SEQ ID NO:53), Figure 53 (SEQ ID NO:55) and 
hsFATP6 (Figure 55; SEQ ID NO:57) will be identified as mapping to chromosomal 
locations listed for the corresponding wild type genes in Table 2 in Example 1. 
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Orthologous genes are gene loci in different species that are sufficiently similar 
to each other in their nucleotide sequences to suggest that they originated from a 
common ancestral gene. Orthologous genes arise when a lineage splits into two species, 
rather than when a gene is duplicated within a genome. Proteins that are orthologs are 
5 encoded by genes of two different species, wherein the genes are said to be orthologous. 
The invention further relates to polynucleotides encoding polypeptides which 
are orthologous to those polypeptides having a specific amino acid sequence described 
herein, such as the amino acid sequences shown in Figure 45 (SEQ ID NO:47), Figure 
47 (SEQ ID NO:49), Figures 94A and 94B (SEQ ID NO: 102), Figure 51 (SEQ ID 

1 0 NO:53), Figure 53 (SEQ ID NO:55), or Figure 55 (SEQ ID NO:57). These 

polynucleotides, which can be called ortholog polynucleotides, encode orthologous 
polypeptides that can range in amino acid sequence identity to a reference amino acid 
sequence described herein, from about 65% to less than 100%, but preferably 70% to 
80%, more preferably 80% to 90%, and still more preferably 90% to less than 100%. 

1 5 Orthologous polypeptides can also be those polypeptides that range in amino acid 
sequence similarity to a reference amino acid sequence described herein from about 
75% to 100%, within the signature sequence. The amino acid sequence similarity 
between the signature sequences of orthologous polypeptides is preferably 80%, more 
preferably 90%, and still more preferably, 95%. The ortholog polynucleotides encode 

2 0 polypeptides that have similar functional characteristics (e.g., fatty acid transport 

activity) and similar tissue distribution, as appropriate to the organism from which the 
ortholog polynucleotides can be isolated. 

Ortholog polynucleotides can be isolated from (e.g., by cloning or nucleic acid 
amplification methods) a great number of species, as shown by the sample of FATPs 

2 5 from evolutionarily divergent species described herein (see, e.g., Figures 44A-C through 
Figure 89). Ortholog polynucleotides corresponding to those in Figure 45 (SEQ ED 
NO:47), Figure 47 (SEQ ID NO:49), Figures 94A and 94B (SEQ ID NO:101), Figure 
51 (SEQ ID NO:53), Figure 52 (SEQ ID NO:54) and Figure 55 (SEQ ID NO:57) are 



WHI9721p3MC2 



-25- 

those which can be isolated from mammals such as rat, dog, chimpanzee, monkey, 
baboon, pig, rabbit and guinea pig, for example. 

Further variants that are fragments of the nucleic acids of the invention may be 
used to synthesize full-length nucleic acids of the invention, such as by use as primers in 
5 a polymerase chain reaction. As used herein, the term primer refers to a single-stranded 
oligonucleotide which acts as a point of initiation of template-directed DNA synthesis 
under appropriate conditions (e.g., in the presence of four different nucleoside 
triphosphates and an agent for polymerization, such as DNA or RNA polymerase or 
reverse transcriptase) in an appropriate buffer and at a suitable temperature. The 

1 0 appropriate length of a primer depends on the intended use of the primer, but typically 
ranges from 15 to 30 nucleotides. Short primer molecules generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. A primer 
need not reflect the exact sequence of the template, but must be sufficiently 
complementary to hybridize with a template. The term primer site refers to the area of 

1 5 the target DNA to which a primer hybridizes. The term primer pair refers to a set of 
primers including a 5 ! (upstream) primer that hybridizes with the 5 ! end of the DNA 
sequence to be amplified and a 3' (downstream) primer that hybridizes with the 
complement of the 3 1 end of the sequence to be amplified. 

Further embodiments of the invention are nucleic acids that are at least 80% 

2 0 identical over their entire length to a nucleic acid described herein, for example a 
nucleic acid having the nucleotide sequence in Figures 44A-44C (SEQ ID NO:46), 
Figures 46A-46B (SEQ ID NO:48), Figures 94A and 94B (SEQ ID NO: 101), Figures 
50A-50C (SEQ ID NO:52), Figure 52 (SEQ ID NO:54), and Figures 54A-54C (SEQ ID 
NO:56). Additional embodiments are nucleic acids, and the complements of such 

2 5 nucleic acids, having at least 90% nucleotide sequence identity to the above-described 
sequences, and nucleic acids having at least 95% nucleotide sequence identity. In 
preferred embodiments, DNA of the present invention has 97% nucleotide sequence 
identity, 98% nucleotide sequence identity, or at least 99% nucleotide sequence identity 
with the DNA whose sequences are presented herein. 
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Other embodiments of the invention are nucleic acids that are at least 80% 
identical in nucleotide sequence to a nucleic acid encoding a polypeptide having an 
amino acid sequence as set forth in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID 
NO:49), Figures 94A and 94B (SEQ ID NO:102), Figure 51 (SEQ ID NO:53), Figure 
5 53 (SEQ ID NO:55) or Figure 55 (SEQ ID NO:57), or as such amino acid sequences are 
set forth elsewhere herein, and nucleic acids that are complementary to such nucleic 
acids. Specific embodiments are nucleic acids having at least 90% nucleotide sequence 
identity to a nucleic acid encoding a polypeptide having an amino acid sequence as 
described in the list above, nucleic acids having at least 95% sequence identity, and 

1 0 nucleic acids having at least 97% sequence identity. 

The terms "complementary" or "complementarity" as used herein, refer to the 
natural binding of polynucleotides under permissive salt and temperature conditions by 
base-pairing. Complementarity between two single-stranded molecules may be 
"partial" in which only some of the nucleic acids bind, or it may be complete when total 

1 5 complementarity exists between the single-stranded molecules (that is, when A-T and 
G-C base pairing is 100% complete). The degree of complementarity between nucleic 
acid strands has significant effects on the efficiency and strength of hybridization 
between nucleic acid strands. This is of particular importance in amplification 
reactions, which depend on binding between nucleic acid strands. 

2 0 The invention further includes nucleic acids that hybridize to the above- 

described nucleic acids, especially those nucleic acids that hybridize under stringent 
hybridization conditions. "Stringent hybridization conditions" or "high stringency 
conditions" generally occur within a range from about T m minus 5°C (5° C below the 
strand dissociation temperature or melting temperature (T m ) of the probe nucleic acid 

2 5 molecule) to about 20° C to 25° C below T m . As will be understood by those of skill in 
the art, the stringency of hybridization may be altered in order to identify or detect 
molecules having identical or related polynucleotide sequences. An example of high 
stringency hybridization follows. Hybridization solution is (6x SSC/10 mM 
EDTA/0.5% SDS/5x Denhardt's solution/100 |lg/ml sheared and denatured salmon 
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sperm DNA). Hybridization is at 64-65°C for 16 hours. The hybridized blot is washed 
two times with 2x SSC/0.5% SDS solution at room temperature for 15 minutes each, 
and two times with 0.2x SSC/0.5% SDS at 65°C, for one hour each. Further examples 
of high stringency conditions can be found on pages 2.10.1-2.10.16 (see particularly 
5 2.10.8-11) and pages 6.3.1-6 in Current Protocols in Molecular Biology (Ausubel, F.M. 
et al 9 eds., containing supplements up through Supplement 42, 1998). Examples of 
high, medium, and low stringency conditions can be found on pages 36 and 37 of WO 
98/40404, which are incorporated herein by reference. 

The invention further relates to nucleic acids obtainable by screening an 

1 0 appropriate library with a probe having a nucleotide sequence such as that set forth in 
Figures 44A-44C (SEQ ID NO:46), Figures 46A-46B (SEQ ID NO:48), Figures 94A 
and 94B (SEQ ID NO: 101), Figures 50A-50C (SEQ ID NO:52), Figure 52 (SEQ ID 
NO:54) or Figures 54A-54C (SEQ ID NO:56), or a probe which is a sufficiently long 
fragment of any of the above; and isolating the nucleic acid. Such probes generally can 

15 comprise at least 15 nucleotides. Nucleic acids obtainable by such screenings may 
include RNAs, cDNAs and genomic DNA, for example, encoding FATPs of the FATP 
family described herein. 

Further uses for the nucleic acid molecules of the invention, whether encoding a 
full-length FATP or whether comprising a contiguous portion of a nucleic acid molecule 

2 0 such as one given in SEQ ID NO:46, 48, 101, 52, 54, or 56, include use as markers for 
tissues in which the corresponding protein is preferentially expressed (to identify 
constitutively expressed proteins or proteins produced at a particular stage of tissue 
differentiation or stage of development of a disease state); as molecular weight markers 
on southern gels; as chromosome markers or tags (when labeled, for example with 

2 5 biotin, a radioactive label or a fluorescent label) to identify chromosomes or to map 
related gene positions; to compare with endogenous DNA sequences in a mammal to 
identify potential genetic disorders; as probes to hybridize and thus identify, related 
DNA sequences; as a source of information to derive PCR primers for genetic 
fingerprinting; as a probe to "subtract-out" known sequences in the process of 
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discovering other novel nucleic acid molecules; for selecting and making oligomers for 
attachment to a "gene chip" or other support, to be used, for example, for examination 
of expression patterns; to raise anti-protein antibodies using DNA immunization 
techniques; and as an antigen to raise anti-DNA antibodies or to elicit another immune 
5 response. 

Further methods to obtain nucleic acids encoding FATPs of the FATP family 
include PCR and variations thereof (e.g., "RACE" PCR and semi-specific PCR 
methods). Portions of the nucleic acids having a nucleotide sequence set forth in 
Figures 44A-44C (SEQ ID NO:46), Figures 46A-46B (SEQ ID NO:48), Figures 94A 

10 and 94B (SEQ ED NO: 101), Figures 50A-50C (SEQ ID NO:52), Figure 52 (SEQ ID 
NO:54) or Figures 54A-54C (SEQ ID NO:56), (especially "flanking sequences" on 
either side of a coding region) can be used as primers in methods using the polymerase 
chain reaction, to produce DNA from an appropriate template nucleic acid. 

Once a fragment of the FATP gene is generated by PCR, it can be sequenced, 

1 5 and the sequence of the product can be compared to other DNA sequences, for example, 
by using the BLAST Network Service at the National Center for Biotechnology 
Information. The boundaries of the open reading frame can then be identified using 
semi-specific PCR or other suitable methods such as library screening. Once the 5' 
initiator methionine codon and the 3' stop codon have been identified, a PCR product 

2 0 encoding the full-length gene can be generated using genomic DNA as a template, with 
primers complementary to the extreme 5' and 3' ends of the gene or to their flanking 
sequences. The full-length genes can then be cloned into expression vectors for the 
production of functional proteins. 

The invention also relates to isolated proteins or polypeptides such as those 

2 5 encoded by nucleic acids of the present invention. Isolated proteins can be purified 
from a natural source or can be made recombinantly. Proteins or polypeptides referred 
to herein as "isolated" are proteins or polypeptides that exist in a state different from the 
state in which they exist in cells in which they are normally expressed in an organism, 
and include proteins or polypeptides obtained by methods described herein, similar 
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state in which they exist in cells in which they are normally expressed in an organism, 
and include proteins or polypeptides obtained by methods described herein, similar 
methods or other suitable methods, and also include essentially pure proteins or 
polypeptides, proteins or polypeptides produced by chemical synthesis or by 
5 combinations of biological and chemical methods, and recombinant proteins or 

polypeptides which are isolated. Thus, the term "isolated" as used herein, indicates that 
the polypeptide in question exists in a physical milieu distinct from that in which it 
occurs in nature. Thus, "isolated" includes existing in membrane fragments and vesicles 
membrane fractions, liposomes, lipid bilayers and other artificial membrane systems. 

10 An isolated FATP may be substantially isolated with respect to the complex cellular 
milieu in which it naturally occurs, and may even be purified essentially to 
homogeneity, for example as determined by PAGE or column chromatography (for 
example, HPLC), but may also have further cofactors or molecular stabilizers, such as 
detergents, added to the purified protein to enhance activity. In one embodiment, 

1 5 proteins or polypeptides are isolated to a state at least about 75% pure; more preferably 
at least about 85% pure, and still more preferably at least about 95% pure, as determined 
by Coomassie blue staining of proteins on SDS-polyacrylamide gels. Proteins or 
polypeptides referred to herein as "recombinant" are proteins or polypeptides produced 
by the expression of recombinant nucleic acids. 

2 0 In a preferred embodiment, an isolated polypeptide comprising a FATP, a 

functional portion thereof, or a functional equivalent of the FATP, has at least one 
function characteristic of a FATP, for example, transport activity, binding function (e.g., 
a domain which binds to AMP), or antigenic function (e.g., binding of antibodies that 
also bind to a naturally-occurring FATP, as that function is found in an antigenic 

2 5 determinant). Functional equivalents can have activities that are quantitatively similar 
to, greater than, or less than, the reference protein. These proteins include, for example, 
naturally occurring FATPs that can be purified from tissues in which they are produced 
(including polymorphic or allelic variants), variants (e.g., mutants) of those proteins 
and/or portions thereof. Such variants include mutants differing by the addition, 
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deletion or substitution of one or more amino acid residues, or modified polypeptides in 
which one or more residues are modified, and mutants comprising one or more modified 
residues. Portions or fragments of a FATP can range in size from four amino acid 
residues to the entire amino acid sequence minus one amino acid. 

The isolated proteins of the invention preferably include mammalian fatty acid 
transport proteins of the FATP family of homologous proteins. In one embodiment, the 
extent of amino acid sequence similarity between a polypeptide having one of the amino 
acid sequences shown in Figure 45 (SEQ ID NO:47), Figure 47 (SEQ ID NO:49), 
Figures 94A and 94B (SEQ ID NO: 102), Figure 51 (SEQ ID NO:53), Figure 53 (SEQ 
ID NO:55), or Figure 55 (SEQ ID NO:57), and the respective functional equivalents of 
these polypeptides is at least about 88%. In other embodiments, the degree of amino 
acid sequence similarity between a FATP and its respective functional equivalent is at 
least about 91%, at least about 94%, or at least about 97%. 

The polypeptides of the invention also include those FATPs encoded by 
polynucleotides which are orthologous to those polynucleotides, the sequences of which 
are described herein in whole or in part. FATPs which are orthologs to those described 
herein by amino acid sequence, in whole or in part, are, for example fatty acid transport 
proteins 1-6 of dog, rat chimpanzee, monkey, rabbit, guinea pig, baboon and pig, and 
are also embodiments of the invention. 

To determine the percent identity or similarity of two amino acid sequences or 
of two nucleic acid sequences, the sequences are aligned for optimal comparison 
purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid 
or nucleic acid sequence for optimal alignment, and non-homologous (dissimilar) 
sequences can be disregarded for comparison purposes). In a preferred embodiment, the 
length of a reference sequence aligned for comparison purposes is at least 30%, 
preferably at least 40%, more preferably at least 50%, even more preferably at least 
60%, and even more preferably at least 70%, 80%, or 90% of the length of the reference 
sequence. The amino acid residues or nucleotides at corresponding amino acid 
positions or nucleotide positions are then compared. When a position in the first 
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sequence is occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position (as 
used herein, amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic 
acid "similarity 1 '). The percent identity between the two sequences is a function of the 
5 number of identical positions shared by the sequences, taking into account the number 
of gaps, and the length of each gap, which need to be introduced for optimal alignment 
of the two sequences. 

The invention also encompasses polypeptides having a lower degree of identity 
but having sufficient similarity so as to perform one or more of the same functions 

1 0 performed by the polypeptides described herein by amino acid sequence. Similarity for 
a polypeptide is determined by conserved amino acid substitution. Such substitutions 
are those that substitute a given amino acid in a polypeptide by another amino acid of 
like characteristics. Conservative substitutions are likely to be phenotypically silent. 
Typically seen as conservative substitutions are the replacements, one for another, 

1 5 among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the hydroxyl 

residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between 
the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and 
replacements among the aromatic residues Phe, Tyr. Guidance concerning which amino 
acid changes are likely to be phenotypically silent is found in Bowie et aL, Science 

2 0 247:1306-1310(1990). 
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TABLE 1 . Conservative Amino Acid Substitutions 



Aromatic 




Phenylalanine 








Tryptophan 








Tyrosine 




Hydrophobic 




Leucine 








Isoleucine 








Valine 




Polar 




Glutamine 








Asparagme 




Basic 




Arginine 








Lysine 








Histidine 




Acidic 




Aspartic Acid 








Glutamic Acid 




Small 




Alanine 








Serine 








Threonine 








Methionine 








Glycine 





The comparison of sequences and determination of percent identity and 
similarity between two sequences can be accomplished using a mathematical algorithm. 
1 0 (Computational Molecular Biology, Lesk, A.M.,ed., Oxford University Press, New 
York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part 1, 
Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 
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Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 
Analysis Primer, Gribskov, M. and Devereaux, J., eds., M. Stockton 
Press, New York, 1991). In a preferred embodiment, the percent identity between two 
amino acid sequences is determined using the Needleman and Wunsch (J. Mol Biol 
5 (48):444-453 (1970)) algorithm which has been incorporated into the GAP program in 
the GCG software package (available at http://www.gcg.com), using either a Blossom 
62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a 
length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent 
identity between two nucleotide sequences is determined using the GAP program in the 

1 0 GCG software package (Devereux, J., et al , Nucleic Acids Res, 12(1):387 (1984)) 

(available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight 
of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another 
embodiment, the percent identity between two amino acid or nucleotide sequences is 
determined using the algorithm of E. Meyers and W. Miller (CABIOS, 4:11-11 (1989)) 

1 5 which has been incorporated into the ALIGN program (version 2.0), using a PAM120 
weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
used as a "query sequence" to perform a search against databases to, for example, 
identify other family members or related sequences. Such searches can be performed 

2 0 using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (J. Mol 
Biol 275:403-10 (1990)). BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, word length = 12 to obtain nucleotide sequences 
homologous to (with calculatably significant similarity to) the nucleic acid molecules of 
the invention. BLAST protein searches can be performed with the XBLAST program, 

2 5 score = 50, word length = 3 to obtain amino acid sequences homologous to the proteins 
of the invention. To obtain gapped alignments for comparison purposes, Gapped 
BLAST can be utilized as described in Altschul et al, (Nucleic Acids Res. 25(17):3389- 
3402 (1997)). When utilizing BLAST and gapped BLAST programs, the default 
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parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://www.ncbi.nlm.nih.gov. 

Similarity for nucleotide and amino acid sequences can be defined in terms of 
the parameters set by the Advanced Blast search available from NCBI (the National 
5 Center for Biotechnology Information; see, for Advanced BLAST page, 

www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-newblast?Jform==l). These default 
parameters, recommended for a query molecule of length greater than 85 amino acid 
residues or nucleotides have been set as follows: gap existence cost, 11, per residue gap 
cost, 1; lambda ratio, 0.85. Further explanation of version 2.0 of BLAST can be found 
10 on related website pages and in Altschul, S.F. et ah, Nucleic Acids Res. 25:3389-3402 
(1997). 

The invention further relates to fusion proteins, comprising a FATP or 
functional portion thereof (as described above) as a first moiety, linked to second 
moiety not occurring in the FATP as found in nature. Thus, the second moiety can be 

15 an amino acid, peptide or polypeptide. The first moiety can be in an N-terminal 

location, C-terminal location or internal to the fusion protein. In one embodiment, the 
fusion protein comprises a FATP as the first moiety, and a second moiety comprising a 
linker sequence and an affinity ligand. Fusion proteins can be produced by a variety of 
methods. For example, a fusion protein can be produced by the insertion of a FATP 

2 0 gene or portion thereof into a suitable expression vector, such as Bluescript SK +/- 
(Stratagene), pGEX-4T-2 (Pharmacia), pET-24(+) (Novagen), or vectors of similar 
construction. The resulting construct can be introduced into a suitable host cell for 
expression. Upon expression, fusion protein can be purified from cells by means of a 
suitable affinity matrix (See e.g., Current Protocols in Molecular Biology, Ausubel, 

25 F.M. et ah, eds., Vol. 2, pp. 16.4.1-16.7.8, containing supplements up through 
Supplement 42, 1998). 

The invention also relates to enzymatically produced, synthetically produced, or 
recombinantly produced portions of a fatty acid transport protein. Portions of a FATP 
can be made which have full or partial function on their own, or which when mixed 
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together (though fully, partially, or nonfunctional alone), spontaneously assemble with 
one or more other polypeptides to reconstitute a functional protein having at least one 
function characteristic of a FATP. 

Fragments of a FATP can be produced by direct peptide synthesis, for example 
5 those using solid-phase techniques (Roberge, J.Y. et al, Science 269:202-204 (1995); 
Merrifield, J., J. Am. Chem. Soc. 55:2149-2154 (1963)). Protein synthesis can be 
performed using manual techniques or by automation. Automated synthesis can be 
carried out using, for instance, an Applied Biosystems 431 A Peptide Synthesizer 
(Perkin Elmer). Various fragments of a FATP can be synthesized separately and 

1 0 combined using chemical methods. 

One aspect of the invention is a peptide or polypeptide having the amino acid 
sequence of a portion of a fatty acid transport protein which is hydrophilic rather than 
hydrophobic, and ordinarily can be detected as facing the outside of the cell membrane. 
Such a peptide or polypeptide can be thought of as being an extracellular domain of the 

1 5 FATP, or a mimetic of said extracellular domain. It is known, for example, that a 
portion of human FATP4 that includes a highly conserved motif is involved in AMP- 
CoA binding function (Stuhlsatz-Krouper, S.M. et al, J. Biol Chem. 44:28642-28650 
(1998)). 

The term "mimetic" as used herein, refers to a molecule, the structure of which 
20 is developed from knowledge of the structure of the FATP of interest, or one or more 
portions thereof, and, as such, is able to effect some or all of the functions of a FATP. 

Portions of an FATP can be prepared by enzymatic cleavage of the isolated 
protein, or can be made by chemical synthesis methods. Portions of a FATP can also be 
made by recombinant DNA methods in which restriction fragments, or fragments that 
2 5 may have undergone further enzymatic processing, or synthetically made DNAs are 
joined together to construct an altered FATP gene. The gene can be made such that it 
encodes one or more desired portions of a FATP. These portions of FATP can be 
entirely homologous to a known FATP, or can be altered in amino acid sequence 
relative to naturally occurring FATPs to enhance or introduce desired properties such as 



WHI9721p3MC2 



-36- 

solubility, stability, or affinity to a ligand. A further feature of the gene can be a 
sequence encoding an N-terminal signal peptide directed to the plasma membrane. 

An extracellular domain can be determined by a hydrophobicity plot, such as 
those shown in Figures 28A, 29A, and 35A, or by a hydrophilicity plot such as those 
5 shown in Figures 28C, 29C, 35C, 91, 92 and 93. A polypeptide or peptide comprising 
all or a portion of a FATP extracellular domain can be used in a pharmaceutical 
composition. When administered to a mammal by an appropriate route, the polypeptide 
or peptide can bind to fatty acids and compete with the native FATPs in the membrane 
of cells, thereby making fewer fatty acid molecules available as substrates for transport 

1 0 into cells, and reducing the amount of fatty acids taken up by, for example, the heart, in 
the case of FATP6. 

Another aspect of the invention relates to a method of producing a fatty acid 
transport protein, variants or portions thereof, and to expression systems and host cells 
containing a vector appropriate for expression of a fatty acid transport protein. 

1 5 Cells that express a FATP, a variant or a portion thereof, or an ortholog of a 

FATP described herein by amino acid sequence, can be made and maintained in culture, 
under conditions suitable for expression, to produce protein in the cells for cell-based 
assays, or to produce protein for isolation. These cells can be procaryotic or eucaryotic. 
Examples of procaryotic cells that can be used for expression include Escherichia coli, 

2 0 Bacillus subtilis and other bacteria. Examples of eucaryotic cells that can be used for 
expression include yeasts such as Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Pichia pastoris and other lower eucaryotic cells, and cells of higher eucaryotes 
such as those from insects and mammals, such as primary cells and cell lines such as 
CHO, HeLa, 3T3 and BHK cells, preferably COS cells and human kidney 293 cells, and 

2 5 more preferably Jurkat cells. (See, e.g., Ausubel, F.M. et al. f eds. Current Protocols in 
Molecular Biology, Greene Publishing Associates and John Wiley & Sons, Inc., 
containing Supplements up through Supplement 42, 1998)). 

In one embodiment, host cells that produce a recombinant FATP, or a portion 
thereof, a variant, or an ortholog of a FATP described herein by amino acid sequence, 
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can be made as follows. A gene encoding a FATP, variant or a portion thereof can be 
inserted into a nucleic acid vector, e.g., a DNA vector, such as a plasmid, phage, 
cosmid, phagemid, virus, virus-derived vector (e.g., SV40, vaccinia, adenovirus, fowl 
pox virus, pseudorabies viruses, retroviruses) or other suitable replicon, which can be 
5 present in a single copy or multiple copies, or the gene can be integrated in a host cell 
chromosome. A suitable replicon or integrated gene can contain all or part of the 
coding sequence for a FATP or variant, operably linked to one or more expression 
control regions whereby the coding sequence is under the control of transcription 
signals and linked to appropriate translation signals to permit translation. The vector 

1 0 can be introduced into cells by a method appropriate to the type of host cells (e.g., 
transfection, electroporation, infection). For expression from the FATP gene, the host 
cells can be maintained under appropriate conditions (e.g., in the presence of inducer, 
normal growth conditions, etc.). Proteins or polypeptides thus produced can be 
recovered (e.g., from the cells, as in a membrane fraction, from the periplasmic space of 

1 5 bacteria, from culture medium) using suitable techniques. Appropriate membrane 
targeting signals may be incorporated into the expressed polypeptide. These signals 
may be endogenous to the polypeptide or they may be heterologous signals. 

Polypeptides of the invention can be recovered and purified from cell cultures 
(or from their primary cell source) by well-known methods including ammonium sulfate 

2 0 or ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and high performance liquid 
chromatography. Known methods for refolding protein can be used to regenerate active 
conformation if the polypeptide is denatured during isolation or purification. 

2 5 In a further aspect of the invention are methods for assessing the transport 

function of any of the fatty acid transport proteins or polypeptides described herein, 
including orthologs, and in variations of these, methods for identifying an inhibitor (or 
an enhancer) of such function and methods for assessing the transport function in the 
presence of a candidate inhibitor or a known inhibitor. 
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A variety of systems comprising living cells can be used for these methods. 
Cells to be used in fatty acid transport assays, and further in methods for identifying an 
inhibitor or enhancer of this function, express one or more FATPs. See Examples 3, 6, 
9, 12 and 14 for data on tissue distribution of expression of FATPs, and Examples 10 
5 and 1 1 describing recombinant cells expressing FATP. Cells for use in cell-based 

assays described herein can be drawn from a variety of sources, such as isolated primary 
cells of various organs and tissues wherein one or more FATPs are naturally expressed. 
In some cases, the cells can be from adult organs, and in some cases, from embryonic or 
fetal organs, such as heart, lung, liver, intestine, skeletal muscle, kidney and the like. 
1 0 Cells for this purpose can also include cells cultured as fragments of organs or in 
□ conditions simulating the cell type and/or tissue organization of organs, in which 

"% artificial materials may be used as substrates for cell growth. Other types of cells 

G suitable for this purpose include cells of a cell strain or cell line (ordinarily comprising 

Hi cells considered to be "transformed") transfected to express one or more FATPs. 

% 15 A further embodiment of the invention is a method for detecting, in a sample of 

3 cells, a fatty acid transport protein, a portion or fragment thereof, a fusion protein 

=5 comprising a FATP or a portion thereof, or an ortholog as described herein, wherein the 

! u z cells can be, for instance, cells of a tissue, primary culture cells, or cells of a cell line, 

Jl including cells into which nucleic acid has been introduced. The method comprises 

2 0 adding to the sample an agent that specifically binds to the protein, and detecting the 
agent specifically bound to the protein. Appropriate washing steps can be added to 
reduce nonspecific binding to the agent. The agent can be, for example, an antibody, a 
ligand or a substrate mimic. The agent can have incorporated into it, or have bound to 
it, covalently or by high affinity non-covalent interactions, for instance, a label that 
2 5 facilitates detection of the agent to which it is bound, wherein the label can be, but is not 
limited to, a phosphorescent label, a fluorescent label, a biotin or avidin label, or a 
radioactive label. The means of detection of a fatty acid transport protein can vary, as 
appropriate to the agent and label used. For example, for an antibody that binds to the 
fatty acid transport protein, the means of detection may call for binding a second 



WHI9721p3MC2 



-39- 

antibody, which has been conjugated to an enzyme, to the antibody which binds the 
fatty acid transport protein, and detecting the presence of the second antibody by means 
of the enzymatic activity of the conjugated enzyme. 

Similar principles can also be applied to a cell lysate or a more purified 
5 preparation of proteins from cells that may comprise a fatty acid transport protein of 
interest, for example in the methods of immunoprecipitation, immunoblotting, 
immunoaffmity methods, that in addition to detection of the particular FATP, can also 
be used in purification steps, and qualitative and quantitative immunoassays. See, for 
instance, chapters 1 1 through 14 in Antibodies: A Laboratory Manual, E. Harlow and 

10 D. Lane, eds., Cold Spring Harbor Laboratory, 1988. 

Isolated fatty acid transport protein or, an antigenically similar portion thereof, 
especially a portion that is soluble, can be used in a method to select and identify 
molecules which bind specifically to the FATP. Fusion proteins comprising all of, or a 
portion of, the fatty acid transport protein linked to a second moiety not occurring in the 

1 5 FATP as found in nature, can be prepared for use in another embodiment of the method. 
Suitable fusion proteins for this purpose include those in which the second moiety 
comprises an affinity ligand (e.g., an enzyme, antigen, epitope). FATP fusion proteins 
can be produced by the insertion of a gene encoding the FATP or a variant thereof, or a 
suitable portion of such gene into a suitable expression vector, which encodes an 

2 0 affinity ligand (e.g., pGEX-4T-2 and pET-15b, encoding glutathione S-transferase and 
His-Tag affinity ligands, respectively). The expression vector can be introduced into a 
suitable host cell for expression. Host cells are lysed and the lysate, containing fusion 
protein, can be bound to a suitable affinity matrix by contacting the lysate with an 
affinity matrix. 

2 5 In one embodiment, the fusion protein can be immobilized on a suitable affinity 

matrix under conditions sufficient to bind the affinity ligand portion of the fusion 
protein to the matrix, and is contacted with one or more candidate binding agents (e.g., a 
mixture of peptides) to be tested, under conditions suitable for binding of the binding 
agents to the FATP portion of the bound fusion protein. Next, the affinity matrix with 
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bound fusion protein can be washed with a suitable wash buffer to remove unbound 
candidate binding agents and non-specifically bound candidate binding agents. Those 
agents which remain bound can be released by contacting the affinity matrix with fusion 
protein bound thereto with a suitable elution buffer. Wash buffer can be formulated to 
5 permit binding of the fusion protein to the affinity matrix, without significantly 

disrupting binding of specifically bound binding agents. In this aspect, elution buffer 
can be formulated to permit retention of the fusion protein by the affinity matrix, but 
can be formulated to interfere with binding of the candidate binding agents to the target 
portion of the fusion protein. For example, a change in the ionic strength or pH of the 

1 0 elution buffer can lead to release of specifically bound agent, or the elution buffer can 
comprise a release component or components designed to disrupt binding of specifically 
bound agent to the target portion of the fusion protein. 

Immobilization can be performed prior to, simultaneous with, or after, 
contacting the fusion protein with candidate binding agent, as appropriate. Various 

1 5 permutations of the method are possible, depending upon factors such as the candidate 
molecules tested, the affinity matrix-ligand pair selected, and elution buffer formulation. 
For example, after the wash step, fusion protein with binding agent molecules bound 
thereto can be eluted from the affinity matrix with a suitable elution buffer (a matrix 
elution buffer, such as glutathione for a GST fusion). Where the fusion protein 

2 0 comprises a cleavable linker, such as a thrombin cleavage site, cleavage from the 
affinity ligand can release a portion of the fusion with the candidate agent bound 
thereto. Bound agent molecules can then be released from the fusion protein or its 
cleavage product by an appropriate method, such as extraction. 

One or more candidate binding agents can be tested simultaneously. Where a 

2 5 mixture of candidate binding agents is tested, those found to bind by the foregoing 
processes can be separated (as appropriate) and identified by suitable methods (e.g., 
PCR, sequencing, chromatography). Large libraries of candidate binding agents (e.g., 
peptides, RNA oligonucleotides) produced by combinatorial chemical synthesis or by 
other methods can be tested (see e.g., Ohlmeyer, M.H.J, et ah, Proc. Natl. Acad Sci. 
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USA 90: 10922-1 0926 (1993) and DeWitt, S.H. et al 9 Proc. Natl Acad. Sci USA 
90:6909-6913 (1993), relating to tagged compounds; see also Rutter, WJ. et al U.S. 
Patent No. 5,010,175; Huebner, V.D. et al, U.S. Patent No. 5,182,366; and Geysen, 
H.M., U.S. Patent No. 4,833,092). Random sequence RNA libraries (see Ellington, 
5 A.D. et al, Nature 345:818-822 (1990); Bock, L.C. et al, Nature 355:584-566 (1992); 
and Szostak, J.W., Trends in Biochem, Sci 77:89-93 (March, 1992)) can also be 
screened according to the present method to select RNA molecules which bind to a 
target FATP or FATP fusion protein. Where binding agents selected from a 
combinatorial library by the present method carry unique tags, identification of 
1 0 individual biomolecules by chromatographic methods is possible. Where binding 

agents do not carry tags, chromatographic separation, followed by mass spectrometry to 
ascertain structure, can be used to identify binding agents selected by the method, for 
example. 

The invention also comprises a method for identifying an agent which inhibits 
15 interaction between a fatty acid transport protein (e.g., one comprising the amino acid 
sequence in SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:51, SEQ ID NO:53, SEQ ID 
NO: 102, or SEQ ID NO:57), and a ligand of said protein. The FATP can be one 
described by amino acid sequence herein, a portion or fragment thereof, a variant 
thereof, or an ortholog thereof, or a FATP fusion protein. Here, a ligand can be, for 
2 0 instance, a substrate, or a substrate mimic, an antibody, or a compound, such as a 
peptide, that binds with specificity to a site on the protein. The method comprises 
combining, not limited to a particular order, the fatty acid protein, the ligand of the 
protein, and a candidate agent to be assessed for its ability to inhibit interaction between 
the protein and the ligand, under conditions appropriate for interaction between the 
2 5 protein and the ligand (e.g., pH, salt, temperature conditions conducive to appropriate 
conformation and molecular interactions); determining the extent to which the protein 
and ligand interact; and comparing (1) the extent of protein-ligand interaction in the 
presence of candidate agent with (2) the extent of protein-ligand interaction in the 
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absence of candidate agent, wherein if (1) is less than (2), then the candidate agent is 
one which inhibits interaction between the protein and the ligand. 

The method can be facilitated, for example, by using an experimental system 
which employs a solid support (column chromatography matrix, wall of a plate, 
5 microtiter wells, column pore glass, pins to be submerged in a solution, beads, etc.) to 
which the protein can be attached. Accordingly, in one embodiment, the protein can be 
fixed to a solid phase directly or indirectly, by a linker. The candidate agent to be tested 
is added under conditions conducive for interaction and binding to the protein. The 
ligand is added to the solid phase system under conditions appropriate for binding. 

1 0 Excess ligand is removed, as by a series of washes done under conditions that do not 
disrupt protein-ligand interactions. Detection of bound ligand can be facilitated by 
using a ligand that carries a label (e.g., fluorescent, chemiluminescent, radioactive). In a 
control experiment, protein and ligand are allowed to interact in the absence of any 
candidate agent, under conditions otherwise identical to those used for the "test" 

1 5 conditions where candidate inhibiting agent is present, and any washes used in the test 
conditions are also used in the control. The extent to which ligand binds to the protein 
in the presence of candidate agent is compared to the extent to which ligand binds to the 
protein in the absence of the candidate agent. If the extent to which interaction of the 
protein and the ligand occurs is less in the presence of the candidate agent than in the 

2 0 absence of the candidate agent, the candidate agent is an agent which inhibits interaction 
between the protein and the ligand of the protein. 

In a further embodiment, an inhibitor (or an enhancer) of a fatty acid transport 
protein can be identified. The method comprises steps which are, or are variations of 
the following: contacting the cells with fatty acid, wherein the fatty acid can be labeled 

2 5 for convenience of detection; contacting a first aliquot of the cells with an agent being 
tested as an inhibitor (or enhancer) of fatty acid uptake while maintaining a second 
aliquot of cells under the same conditions but without contact with the agent; and 
measuring (e.g., quantitating) fatty acid in the first and second aliquots of cells; wherein 
a lesser quantity of fatty acid in the first aliquot compared to that in the second aliquot 
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is indicative that the agent is an inhibitor of fatty acid uptake by a fatty acid transport 
protein. A greater quantity of fatty acid in the first aliquot compared to that in the 
second aliquot is indicative that the agent is an enhancer of fatty acid uptake by a fatty 
acid transport protein. 

5 A particular embodiment of identifying an inhibitor or enhancer of fatty acid 

transport function employs the above steps, but also employs additional steps preceding 
those given above: introducing into cells of a cell strain or cell line ("host cells" for the 
intended introduction of, or after the introduction of, a vector) a vector comprising a 
fatty acid transport protein gene, wherein expression of the gene can be regulatable or 
1 0 constitutive, and providing conditions to the host cells under which expression of the 
gene can occur. 

The terms "contacting" and "combining" as used herein in the context of 
bringing molecules into close proximity to each other, can be accomplished by 
conventional means. For example, when referring to molecules that are soluble, 
1 5 contacting is achieved by adding the molecules together in a solution. "Contacting" can 
also be adding an agent to a test system, such as a vessel containing cells in tissue 
culture. 

The term "inhibitor" or "antagonist", as used herein, refers to an agent which 
blocks, diminishes, inhibits, hinders, limits, decreases, reduces, restricts or interferes 

2 0 with fatty acid transport into the cytoplasm of a cell, or alternatively and additionally, 
prevents or impedes the cellular effects associated with fatty acid transport. The term 
"enhancer" or "agonist", as used herein, refers to an agent which augments, enhances, or 
increases fatty acid transport into the cytoplasm of a cell. An antagonist will decrease 
fatty acid concentration, fatty acid metabolism and byproduct levels in the cell, leading 

25 to phenotypic and molecular changes. 

In order to produce a "host cell" type suitable for fatty acid uptake assays and for 
assays derived therefrom for identifying inhibitors or enhancers thereof, a nucleic acid 
vector can be constructed to comprise a gene encoding a fatty acid transport protein, for 
example, human FATP1, FATP2, FATP3, FATP4, FATP5, FATP6, a mutant or variant 
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thereof, an ortholog of the human proteins, such as mouse orthologs or orthologs found 
in other mammals, or a FATP family protein of origin in an organism other than a 
mammal. The gene of the vector can be regulatable, such as by the placement of the 
gene under the control of an inducible or repressible promoter in the vector (e.g., 
5 inducible or repressible by a change in growth conditions of the host cell harboring the 
vector, such as addition of inducer, binding or functional removal of repressor from the 
cell millieu, or change in temperature) such that expression of the FATP gene can be 
turned on or initiated by causing a change in growth conditions, thereby causing the 
protein encoded by the gene to be produced, in host cells comprising the vector, as a 
1 0 plasma membrane protein. Alternatively, the FATP gene can be constitutively 
expressed. 

A vector comprising an FATP gene, such as a vector described herein, can be 
introduced into host cells by a means appropriate to the vector and to the host cell type. 
For example, commonly used methods such as electroporation, transfection, for 

15 instance, transfection using CaCl 2 , and transduction (as for a virus or bacteriophage) can 
be used. Host cells can be, for example, mammalian cells such as primary culture cells 
or cells of cell lines such as COS cells, 293 cells or Jurkat cells. Host cells can also be, 
in some cases, cells derived from insects, cells of insect cell lines, bacterial cells, such 
as E, coli, or yeast cells, such as S. cerevisiae. It is preferred that the fatty acid transport 

2 0 protein whose function is to be assessed, with or without a candidate inhibitor or 

enhancer, be produced in host cells whose ancestor cells originated in a species related 
to the species of origin of the FATP gene encoding the fatty acid transport protein. For 
example, it is preferable that tests of function or of inhibition or enhancement of a 
mammalian FATP be carried out in host mammalian cells producing the FATP, rather 

2 5 than bacterial cells or yeast cells. 

Host cells comprising a vector comprising a regulatable FATP gene can be 
treated so as to allow expression of the FATP gene and production of the encoded 
protein (e.g., by contacting the cells with an inducer compound that effects transcription 
from an inducible promoter operably linked to the FATP gene). 
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The test agent (e.g., an agonist or antagonist) is added to the cells to be used in a 
fatty acid transport assay, in the presence or absence of test agent, under conditions 
suitable for production and/or maintenance of the expressed FATP in a conformation 
appropriate for association of the FATP with test agent and substrate. For example, 
5 conditions under which an agent is assessed, such as media and temperature 

requirements, can, initially, be similar to those necessary for transport of typical fatty 
acid substrates across the plasma membrane. One of ordinary skill in the art will know 
how to vary experimental conditions depending upon the biochemical nature of the test 
agent. The test agent can be added to the cells in the presence of fatty acid, or in the 
1 0 absence of fatty acid substrate, with the fatty acid substrate being added following the 
addition of the test agent. The concentration at which the test agent can be evaluated 
can be varied, as appropriate, to test for an increased effect with increasing 
concentrations. 

Test agents to be assessed for their effects on fatty acid transport can be any 
1 5 chemical (element, molecule, compound), made synthetically, made by recombinant 
techniques or isolated from a natural source. For example, test agents can be peptides, 
polypeptides, peptoids, sugars, hormones, or nucleic acid molecules, such as antisense 
nucleic acid molecules. In addition, test agents can be small molecules or molecules of 
greater complexity made by combinatorial chemistry, for example, and compiled into 
2 0 libraries. These libraries can comprise, for example, alcohols, alkyl halides, amines, 
amides, esters, aldehydes, ethers and other classes of organic compounds. Test agents 
can also be natural or genetically engineered products isolated from lysates of cells, 
bacterial, animal or plant, or can be the cell lysates themselves. Presentation of test 
compounds to the test system can be in either an isolated form or as mixtures of 
2 5 compounds, especially in initial screening steps. 

Thus, the invention relates to a method for identifying agents which alter fatty 
acid transport, the method comprising providing the test agent to the cell (wherein "cell" 
includes the plural, and can include cells of a cell strain, cell line or culture of primary 
cells or organ culture, for example), under conditions suitable for binding to its target, 
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whether to the FATP itself or to another target on or in the cell, wherein the transformed 
cell comprises a FATP. 

In greater detail, to test one or more agents or compounds (e.g., a mixture of 
compounds can conveniently be screened initially) for inhibition of the transport 
5 function of a fatty acid transport protein, the agent(s) can be contacted with the cells. 
The cells can be contacted with a labeled fatty acid. The fatty acid can be, for example, 
a known substrate of the fatty acid transport protein such as oleate or palmitate. The 
fatty acid can itself be labeled with a radioactive isotope, (e.g., 3 H or 14 C) or can have a 
radioactively labeled adduct attached. In other variations, the fatty acid can have 
1 0 chemically attached to it a fluorescent label, or a substrate for an enzyme occurring 
within the cells, wherein the substrate yields a detectable product, such as a highly 
colored or fluorescent product. Addition of candidate inhibitors and labeled substrate to 
the cells comprising fatty acid transport protein can be in either order or can be 
simultaneous. 

15 A second aliquot of cells, which can be called "control" cells (a "first" aliquot of 

cells can be called "test" cells), is treated, if necessary (as in the case of transformed 
!, host"cells), so as to allow expression of the FATP gene, and is contacted with the 
labeled substrate of the fatty acid transport protein. The second aliquot of cells is not 
contacted with one or more agents to be tested for inhibition of the transport function of 

2 0 the protein produced in the cells, but is otherwise kept under the same culture conditions 
as the first aliquot of cells. 

In a further step of a method to identify inhibitors of a fatty acid transport 
protein, the labeled fatty acid is measured in the first and second aliquots of cells. A 
preliminary step of this measurement process can be to separate the external medium 

2 5 from the cells so as to be able to distinguish the labeled fatty acid external to the cells 
from that which has been transported inside the cells. This can be accomplished, for 
instance, by removing the cells from their growth container, centrifuging the cell 
suspension, removing the supernatant and performing one or more wash steps to 
extensively dilute the remaining medium which may contain labeled fatty acid. 
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Detection of the labeled fatty acid can be by a means appropriate to the label used. For 
example, for a radioactive label, detection can be by scintillation counting of 
appropriately prepared samples of cells (e.g., lysates or protein extracts); for a 
fluorescent label, by measuring fluorescence in the cells by appropriate instrumentation. 

If a compound tested as a candidate inhibitor of transport function causes the test 
cells to have less labeled fatty acid detected in the cells than that detected in the control 
cells, then the compound is an inhibitor of the fatty acid transport protein. Procedures 
analogous to those above can be devised for identifying enhancers (agonists of FATPs) 
of fatty acid transport function wherein if the test cells contain more labeled fatty acid 
than that detected in the control cells, or if the fatty acid is taken up at a higher rate, then 
the compound being tested can be concluded to be an enhancer of the fatty acid 
transport protein. 

Example 13 describes use of an assay of this type to identify an inhibitor of a 
FATP. In Example 13, an antisense oligonucleotide which specifically inhibits 
biosynthesis of mmFATP4 was demonstrated to inhibit fatty acid uptake into mouse 
enterocytes. Similarly, antisense oligonucleotides directed towards specifically 
inhibiting the biosynthesis of FATP6 in heart cells, FATP5 in liver cells, FATP3 in lung 
cells, and FATP2 in colon cells, can be demonstrated as examples of "test agents" that 
inhibit fatty acid transport. 

Another assay to determine whether an agent is an inhibitor (or enhancer) of 
fatty acid transport employs animals, one or more of which are administered the agent, 
and one or more of which are maintained under similar conditions, but are not 
administered the agent. Both groups of animals are given fatty acids (e.g., orally, 
intravenously, by tube inserted into stomach or intestine), and the fatty acids taken up 
into a bodily fluid (e.g., serum) or into an organ or tissue of interest are measured from 
comparable samples taken from each group of animals. The fatty acids may carry a 
label (e.g., radioactive) to facilitate detection and quantitation of fatty acids taken up 
into the fluid or tissue being sampled. This type of assay can be used alone or can be 
used in addition to in vitro assays of a candidate inhibitor or enhancer. 



WHI9721p3MC2 



-48- 

An agent detennined to be an inhibitor (or enhancer) of F ATP function, such as 
fatty acid binding and/or fatty acid uptake, can be administered to cells in culture, or in 
vivo, to a mammal (e.g. human) to inhibit (or enhance) FATP function. Such an agent 
may be one that acts directly on the FATP (for example, by binding) or can act on an 
5 intermediate in a biosynthetic pathway to produce FATP, such as transcription of the 
FATP gene, processing of the mRNA, or translation of the mRNA. An example of such 
an agent is antisense oligonucleotide. 

Antisense methods similar to those illustrated in Example 13 can be used to 
determine the target FATP of a compound or agent that has an inhibitory or enhancing 

1 0 effect on fatty acid uptake. For example, antisense oligonucleotide directed to the 

inhibition of FATP4 biosynthesis can be added to lung cells or cell lines derived from 
lung cells. In addition, antisense oligonucleotides directed to the inhibition of other 
FATPs, except for FATP3, can also be added to the lung cells. The administration of 
antisense oligonucleotides in this manner ensures that the predominant FATP activity 

15 remaining in the cells comes from FATP3. After a period of incubation of the cells with 
the antisense oligonucleotides sufficient to deplete the plasma membrane of the FATPs 
whose biosynthesis has been inhibited, a test agent, preferably one that has been shown 
by some preliminary test to have an inhibitory or enhancing activity on fatty acid 
transport, can be added to the lung cells. If the test agent is now demonstrated, after 

2 0 treatment of the cells with antisense oligonucleotides, to have an inhibitory or 

enhancing activity on fatty acid transport in the lung cells, it can be concluded that the 
target of the test agent is FATP3, or a molecule involved in the biosynthesis or activity 
of FATP3. 

In another type of cell-based assay for uptake of fatty acids, a change of 
2 5 intracellular pH resulting from the uptake of fatty acids can be followed by an indicator 
fluorophore. The fluorophore can be taken up by the cells in a preincubation step. Fatty 
acids can be added to the cell medium, and after some period of incubation to allow 
FATP-mediated uptake of fatty acids, the change in A max of fluorescence can be 
measured, as an indicator of a change in intracellular pH, as the A max of fluorescence of 
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the fluorophore changes with the pH of its environment, thereby indicating uptake of 
fatty acids. One such fluorophore is BCECF (2', 7'-bis(2-carboxyethyl)-5(6)- 
carboxyfluorescein; Rink, TJ. etaL, J. Cell Biol 95: 189 (1982)). 

In assays similar to those described above, a candidate inhibitor or enhancer of 
5 fatty acid transport function can be added (or mock-added, for control cultures) to 
cultures of cells engineered to express a desired FATP to which fatty acid substrate is 
also added. Inhibition of fatty acid uptake is indicated by a lack of the drop in pH, 
indicating fatty acid uptake, that is seen in control cells. Enhancement of fatty acid 
uptake is indicated by a decrease in intracellular pH, as compared to control cells not 

1 0 receiving the candidate enhancer of fatty acid transport function. 

Yeast cells can be used in a similar cell-based assay for the uptake of fatty acids 
mediated by a FATP, and such an assay can be adapted to a screening assay for the 
identification of agents that inhibit or enhance fatty acid uptake by an FATP. Yeast 
cells lacking an endogenous FATP activity (mutated, disrupted or deleted for FAT1; 

15 Faergeman, NJ. et al 9 J. Biol Chem. 272(13):8531-8538 (1997); Watkins, P.A. et al 9 
J. Biol Chem, 273(29):18210-18219 (1998)) can be engineered to harbor a related gene 
of the family of FATP-encoding genes, such as a mammalian FATP (e.g., human 
FATP4). 

Examples of expression vectors include pEG (Mitchell, D.A., et aL 9 Yeast 9:715- 
2 0 723 (1993)) and pDADl and pDAD2, which contain a GAL1 promoter (Davis, L. I. and 
Fink, G. R., Cell 61:965-91% (1990)). A variety of promoters are suitable for 
expression. Available yeast vectors offer a choice of promoters. In one embodiment, 
the inducible GAL1 promoter is used. In another embodiment, the constitutive ADH1 
promoter (alcohol dehydrogenase; Bennetzen, J. L. and Hall, B. D., J. Biol Chem. 
2 5 257:3026-3031 (1982)) can be used to express an inserted gene on glucose-containing 
media. An example of a vector suitable for expression of a heterologous FATP gene in 
yeast is pQB169. 

With the introduced FATP gene providing the only fatty acid transport protein 
function for the yeast cells, it is possible to study effect of the heterologous FATP on 
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fatty acid transport into the yeast cells in isolation. Assays for the uptake of fatty acids 
into the yeast cells can be devised that are similar to those described above and/or those 
assays that have been illustrated in the Examples. Tests for candidate inhibitors or 
enhancers of the heterologous FATP can be done in cultures of yeast cells, wherein the 
5 yeast cells are incubated with fatty acid substrate and an agent to be tested as an 
inhibitor or enhancer of FATP function. FATP uptake after a period of time can be 
measured by analyzing the contents of the yeast cells for fatty acid substrate, as 
compared with control yeast cells incubated with the fatty acid, but not with the test 
agent. Yeast cells have the additional advantage, over mammalian cells in culture, for 

1 0 example, that yeast cells can be forced to rely upon fatty acids as their only source of 
carbon, if the growth medium supplied to the yeast cells is formulated to contain no 
other source of carbon. Thus, the effect of the heterologous FATP on fatty acid uptake 
and metabolism in the engineered yeast cells can be amplified. An agent that efficiently 
blocks transport function of the heterologous FATP could result in death of the yeast 

1 5 cells. Thus, in this case, inhibition of function of the heterologous FATP can result in 
loss of viability. A simple measure of viability is turbidity of the yeast suspension 
culture, which can be adapted to a high throughput screening assay for effects of various 
agents to be tested, using microtiter plates or similar devices for small-volume cultures 
of the engineered yeast cells. 

2 0 Cell-free assays can also be used to measure the transport of fatty acids across a 

membrane, and therefor also to assess a test treatment or test agent for its effect on the 
rate or extent of fatty acid transport. An isolated FATP, for example in the presence of 
a detergent that preserves the native 3-dimensional structure of the FATP, or partially 
purified FATP, can be used in an artificial membrane system typically used to preserve 

25 the native conformation and activity of membrane proteins. Such systems include 
liposomes, artificial bilayers of phospholipids, isolated plasma membrane such as cell 
membrane fragments, cell membrane fractions, or cell membrane vesicles, and other 
systems in which the FATP can be properly oriented within the membrane to have 
transport activity. Assays for transport activity can be performed using methods 



WHI9721p3MC2 



-51- 

analogous to those that can be used in cells engineered to predominantly express one 
FATP whose function is to be measured. A labeled (e.g., radioactively labeled) fatty 
acid substrate can be incubated with one side of a bilayer or in a suspension of 
liposomes constructed to integrate a properly oriented FATP. The accumulation of fatty 
5 acids with time can be measured, using appropriate means to detect the label (e.g., 
scintillation counting of medium on each side of the bilayer, or of the contents of 
liposomes isolated from the surrounding medium). Assays such as these can be adapted 
to use for the testing of agents which might interact with the FATP to produce an 
inhibitory or an enhancing effect on the rate or extent of fatty acid transport. That is, the 

1 0 above-described assay can be done in the presence or absence of the agent to be tested, 
and the results compared. 

For examples of isolation of membrane proteins (ADP/ATP carrier and 
uncoupling protein), reconstitution into phospholipid vesicles, and assays of transport, 
see Klingenberg, M. et al, Methods Enzymol 250:369-389 (1995). For an example of a 

1 5 membrane protein (phosphate carrier of Saccharomyces cerevisiae) that was purified 
and solubilized from E. coli inclusion bodies, see Schroer, A. et al, J. Biol Chem. 273: 
14269-14276 (1998). The Glutl glucose transporter of rat has been expressed in yeast. 
A crude membrane fraction of the yeast was prepared and reconstituted with soybean 
phospholipids into liposomes. Glucose transport activity could be measured in the 

2 0 liposomes (Kasahara, T. and Kasahara, M., J. Biol Chem. 273: 291 13-291 17 (1998)). 
Similar methods can be applied to the proteins and polypeptides of the invention. 

Another embodiment of the invention is a method for inhibiting fatty acid 
uptake in a mammal (e.g., a human), comprising administering to the mammal a 
therapeutically effective amount of an inhibitor of the transport function of one or more 

2 5 of the fatty acid transport proteins, thereby decreasing fatty acid uptake by cells 

comprising the fatty acid protein(s). Where it is desirable to reduce the uptake of fatty 
acids, for example, in the treatment of chronic obesity or as a part of a program of 
weight control or hyperlipidemia control in a human, one or more inhibitors of one or 
more of the fatty acid transport proteins can be administered in an effective dose, and by 
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an effective route, for example, orally, or by an indwelling device that can deliver doses 
to the small intestine. The inhibitor can be one identified by methods described herein, 
or can be one that is, for instance, structurally related to an inhibitor identified by 
methods described herein (e.g., having chemical adducts to better stabilize or solubilize 
5 the inhibitor). The invention further relates to compositions comprising inhibitors of 
fatty acid uptake in a mammal, which may further comprise pharmaceutical carriers 
suitable for administration to a subject mammal, such as sterile solubilizing or 
emulsifying agents. 

A further embodiment of the present invention is a method of enhancing or 

1 0 increasing fatty acid uptake, such as enhancing or increasing LCFA uptake in the small 
intestine (e.g., to treat or prevent a malabsorption syndrome or other wasting condition) 
or in the liver (e.g., by an enhancer of FATP5 transport activity to treat acute liver 
failure) or in the kidney (e.g., by an enhancer of FATP2 transport activity to treat kidney 
failure). In this embodiment, a therapeutically effective amount of an enhancer of the 

1 5 transport function of one or more of the fatty acid transport proteins can be administered 
to a mammalian subject, with the result that fatty acid uptake in the small intestine is 
enhanced. In this embodiment, one or more enhancers of one or more of fatty acid 
transport proteins is administered in an effective dose and by a route (e.g., orally or by a 
device, such as an indwelling catheter or other device) which can deliver doses to the 

2 0 gut. The enhancer of FATP function (e.g., an enhancer of FATP4 function) can be 
identified by methods described herein or can be one that is structurally similar to an 
enhancer identified by methods described herein. 

Aerobic reperfusion of ischemic myocardium is a common clinical event which 
can occur during such treatments as cardiac surgery, angioplasty, and thrombolytic 

2 5 therapy after a myocardial infarction. During reperfusion, a rapid recovery of 

myocardial energy production is essential for the complete recovery of contractile 
function. Not only the extent of recovery of myocardial energy metabolism but also the 
type of energy substrate used by the heart during reperfusion are important determinants 
of functional recovery. Circulating fatty acid levels increase following acute 
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myocardial infarction or during cardiac surgery, such that during and following 
ischemia the heart muscle can be exposed to very high concentrations of fatty acids 
(Lopaschuk, G.D. and W. C. Stanley, Science and Medicine (November/December 
1997)). High plasma fatty acid concentrations increase the severity of ischemic damage 
in a number of experimental models of cardiac ischemia and have been linked to 
depression of mechanical function during aerobic reperfusion of previously ischemic 
hearts. Further data show that modifying fatty acid utilization can be beneficial for 
heart function in ischemia and can be a useful approach for the treatment of angina. 
See, e.g., Desideri and Celegon,^(m. J. Cardiol 82(5A) :50K-53K; Lopaschuk, Am. J. 
Cardiol. 8 2 (5 A): 14K-17K. Plasma fatty acid concentrations can be reduced by 
administering to a human subject or other mammal an effective amount of an inhibitor 
of a FATP such as FATP2 or FATP4, thereby providing a way of reducing fatty acid 
utilization by the heart. 

In a further embodiment of the invention, a therapeutically effective amount of 
an inhibitor of hsFATP6 can be administered to a human patient by a suitable route, to 
reduce the uptake of fatty acids by cardiac muscle. This treatment is desirable in 
patients who are diagnosed as having, or who are at risk of, abnormal accumulations of 
fatty acids in the heart or a detrimentally high rate of uptake of fatty acids into the heart, 
because of ischemic heart disease, or following ischemia or trauma to the heart. 

The invention further relates to antibodies that bind to an isolated or 
recombinant fatty acid transport protein of the FATP family, including portions of 
antibodies, which can specifically recognize and bind to one or more FATPs. The 
antibodies and portions thereof of the invention include those which bind to one or more 
FATPs of mouse or other mammalian species. In a preferred embodiment, the 
antibodies specifically bind to a naturally occurring FATP of humans. The antibodies 
can be used in methods to detect or to purify a protein of the present invention or a 
portion thereof by various methods of immunoaffinity chromatography, to inhibit the 
function of a protein in a method of therapy, or to selectively inactivate an active site, or 
to study other aspects of the structure of these proteins, for example. 
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The antibodies of the present invention can be polyclonal or monoclonal. The 
term antibody is intended to encompass both polyclonal and monoclonal antibodies. 
Antibodies of the present invention can be raised against an appropriate immunogen, 
including proteins or polypeptides of the present invention, such as an isolated or 
recombinant FATP1, FATP2, FATP3, FATP4, FATP5, FATP6, mtFATP, ceFATPa, 
ceFATPb, scFATP or portions thereof, or synthetic molecules, such as synthetic 
peptides (e.g., conjugated to a suitable carrier). Preferred embodiments are antibodies 
that bind to any of the following: hsFATPl, hsFATP2, hsFATP3, hsFATP4, hsFATPS 
or hsFATP6. The immunogen can be a polypeptide comprising a portion of a FATP 
and having at least one function of a fatty acid transport protein, as described herein. 

The term antibody is also intended to encompass single chain antibodies, 
chimeric, humanized or primatized (CDR-grafted) antibodies and the like, as well as 
chimeric or CDR-grafted single chain antibodies, comprising portions from more than 
one species. For example, the chimeric antibodies can comprise portions of proteins 
derived from two different species, joined together chemically by conventional 
techniques or prepared as a single contiguous protein using genetic engineering 
techniques (e.g., DNA encoding the protein portions of the chimeric antibody can be 
expressed to produce a contiguous protein chain. See, e.g., Cabilly et al., U.S. Patent 
No. 4,816,567; Cabilly et al, European Patent No. 0,125,023 Bl; Boss et al, U.S. 
Patent No. 4,816,397; Boss et al, European Patent No. 0,120,694 Bl; Neuberger, M.S. 
et al, WO 86/01533; Neuberger, M.S. et al, European Patent No. 0,194,276 Bl; 
Winter, U.S. Patent No. 5,225,539; Winter, European Patent No. 0,239,400 Bl; Queen 
et al, U.S. Patent No. 5,585,089; and Queen et al, European Patent No. EP 0 451 216 
Bl. See also, Newman, R. et al f BioTechnology, 70.T455-1460 (1992), regarding 
primatized antibody, and Ladner et al, U.S. Patent No. 4,946,778 and Bird, R.E. et al, 
Science, 242:423-426 (1988) regarding single chain antibodies.) 

Whole antibodies and biologically functional fragments thereof are also 
encompassed by the term antibody. Biologically functional antibody fragments which 
can be used include those fragments sufficient for binding of the antibody fragment to a 
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FATP to occur, such as Fv, Fab, Fab' and F(ab') 2 fragments. Such fragments can be 
produced by enzymatic cleavage or by recombinant techniques. For instance, papain or 
pepsin cleavage can generate Fab or F(ab') 2 fragments, respectively. Antibodies can 
also be produced in a variety of truncated forms using antibody genes in which one or 
5 more stop codons have been introduced upstream of the natural stop site. For example, 
a chimeric gene encoding a F(ab') 2 heavy chain portion can be designed to include DNA 
sequences encoding the CHj domain and hinge region of the heavy chain. 

Preparation of immunizing antigen (whole cells comprising FATP on the cell 
surface or purified FATP), and polyclonal and monoclonal antibody production can be 

1 0 performed using any suitable technique. A variety of methods have been described (See 
e.g., Kohler et al, Nature, 256: 495-497 (1975) and Eur. 1 Immunol 6: 51 1-519 
(1976); Milstein et al, Nature 266: 550-552 (1977); Koprowski et al, U.S. Patent No. 
4,172,124; Harlow, E. and D. Lane, 1988, Antibodies: A Laboratory Manual, (Cold 
Spring Harbor Laboratory: Cold Spring Harbor, NY); Chapter 1 1 In Current Protocols 

15 In Molecular Biology, Vol. 2 (containing supplements up through Supplement 42, 

1998), Ausubel, F.M. et al, eds., (John Wiley & Sons: New York, NY)). Generally, a 
hybridoma can be produced by fusing a suitable immortal cell line (e.g., a myeloma cell 
line such as SP2/0) with antibody producing cells. The antibody producing cells, 
preferably those obtained from the spleen or lymph nodes, can be obtained from animals 

2 0 immunized with the antigen of interest. Immunization of animals can be by 

introduction of whole cells comprising fatty acid transport protein on the cell surface. 
The fused cells (hybridomas) can be isolated using selective culture conditions, and 
cloned by limiting dilution. Cells which produce antibodies with the desired specificity 
can be selected by a suitable assay (e.g., ELISA). 

2 5 Other suitable methods of producing or isolating antibodies (including human 

antibodies) of the requisite specificity can used, including, for example, methods which 
select recombinant antibody from a library (e.g., Hoogenboom et al, WO 93/06213; 
Hoogenboom et al, U.S. Patent No. 5,565,332; WO 94/13804, published June 23, 
1994; and Dower, W J. et al, U.S. Patent No. 5,427,908), or which rely upon 
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immunization of transgenic animals (e.g., mice) capable of producing a full repertoire of 
human antibodies (see e.g., Jakobovits et al, Proc. Natl Acad. Set USA, 90: 2551-2555 
(1993); Jakobovits et al, Nature, 562:255-258 (1993); Lonberg et al, U.S. Patent No. 
5,569,825; Lonberg et al, U.S. Patent No. 5,545,806; Surani et al, U.S. Patent No. 
5 5,545,807; and Kucherlapati, R. et al, European Patent No. EP 0 463 151 Bl). 

Another aspect of the invention is a method for directing an agent to cardiac 
muscle. The differential expression of FATP6 in cardiac muscle but not in other tissue 
types allows for the specific targeting of drugs, diagnostic agents, tagging labels, 
histological stains or other substances specifically to cardiac muscle. A targeting 

1 0 vehicle can be used for the delivery of such a substance. Targeting vehicles which bind 
specifically to FATP6 can be linked to a substance to be delivered to the cells of cardiac 
muscle. The linkage can be, for instance, via one or more covalent bonds, or by high 
affinity non-covalent bonds. A targeting vehicle can be an antibody, for instance, or 
other compound (e.g., a fatty acid or fatty acid analog) which binds to FATP6 with high 

15 specificity. 

Targeting vehicles specific to the heart-specific protein FATP6 have in vivo 
(e.g., therapeutic and diagnostic) applications. For example, an antibody which 
specifically binds to FATP6 can be conjugated to a drug to be targeted to the heart (e.g., 
a cardiac glycoside to treat congestive heart failure, or P-adrenergic agents, sodium 

2 0 channel blockers or calcium channel blockers to treat arrhythmias). A substance (e.g., a 
radioactive substance) which can be detected (e.g., a label) in vivo can also be linked to 
a targeting vehicle which specifically binds to a heart-specific protein such as FATP6, 
and the conjugate can be used as a labeling agent to identify cardiac muscle cells. 
Targeting vehicles specific to FATP6 find further applications in vitro. For 

2 5 example, an FATP6-specific targeting vehicle, such as an antibody (a polyclonal 
preparation or monoclonal) which specifically binds to FATP6, can be linked to a 
substance which can be used as a stain for a tissue sample (e.g., horseradish peroxidase) 
to provide a method for the identification of cardiac muscle in a sample, as can be used 
in embryology studies, for example. 
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In a similar manner, an agent can be directed to the liver of a mammal, as 
FATP5 is expressed in liver but not in other tissue types, A targeting vehicle which 
specifically binds to FATP5 can be conjugated to a drug for delivery of the drug to the 
liver, such as a drug to treat hepatitis, Wilson's disease, lipid storage diseases and liver 
cancer. As with targeting vehicles specific to FATP6, targeting vehicles specific to 
FATP5 can be used in studying tissue samples in vitro. 

The invention also relates to compositions comprising a modulator of FATP 
function. The term "modulate" as used herein refers to the ability of a molecule to alter 
the function of another molecule. Thus, modulate could mean, for example, inhibit, 
antagonize, agonize, upregulate, downregulate, induce, or suppress. A modulator has 
the capability of altering function of its target. Such alteration can be accomplished at 
any stage of the transcription, translation, expression or function of the protein, so that, 
for example, modulation of a target gene can be accomplished by modulation of the 
DNA or RNA encoding the protein, and the protein itself. 

Antagonists or agonists (inhibitors or enhancers) of the FATPs of the invention, 
antibodies that bind a FATP, or mimetics of a FATP can be employed in combination 
with a non-sterile or sterile carrier or carriers for use with cells, tissues or organisms, 
such as a pharmaceutical carrier suitable for administration to a mammalian subject. 
Such compositions comprise, for instance, a media additive or a therapeutically 
effective amount of an inhibitor or enhancer compound to be identified by an assay of 
the invention and a pharmaceutically acceptable carrier or excipient. Such carriers may 
include, but are not limited to, saline, buffered saline, dextrose, water, ethanol, 
surfactants, such as glycerol, excipients such as lactose and combinations thereof. The 
formulation can be chosen by one of ordinary skill in the art to suit the mode of 
administration. The chosen route of administration will be influenced by the 
predominant tissue or organ location of the FATP whose function is to be inhibited or 
enhanced. For example, for affecting the function of FATP4, a preferred administration 
can be oral or through a tube inserted into the stomach (e.g., direct stomach tube or 
nasopharyngeal tube), or through other means to accomplish delivery to the small 
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intestine. The invention further relates to diagnostic and pharmaceutical packs and kits 
comprising one or more containers filled with one or more of the ingredients of the 
aforementioned compositions of the invention. 

Compounds of the invention which are FATPs, FATP fusion proteins, FATP 
5 mimetics, FATP gene-specific antisense poly- or oligonucleotides, inhibitors or 

enhancers of a FATP may be employed alone or in conjunction with other compounds, 
such as therapeutic compounds. The pharmaceutical compositions may be administered 
in any effective, convenient manner, including administration by topical, oral, anal, 
vaginal, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, 

1 0 transdermal or intradermal routes, among others. In therapy or as a prophylactic, the 
active agent may be administered to an individual as an injectable composition, for 
example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively, the composition may be formulated for topical application, for 
example, in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 

15 mouthwash, impregnated dressings and sutures and aerosols, and may contain 

appropriate conventional additives, including, for example, preservatives, solvents to 
assist drug penetration, and emollients in ointments and creams. Such topical 
formulations may also contain compatible conventional carriers, for example cream or 
ointment bases, and ethanol or oleyl alcohol for lotions. 

2 0 In addition, the amount of the compound will vary depending on the size, age, 

body weight, general health, sex, and diet of the host, and the time of administration, the 
biological half-life of the compound, and the particular characteristics and symptoms of 
the disorder to be treated. Adjustment and manipulation of established dose ranges are 
well within the ability of those of skill in the art. 

2 5 A further aspect of the invention is a method to identify a polymorphism, or the 

presence of an alternative or variant allele of a gene in the genome of an organism (of 
interest here, genes encoding FATPs). As used herein, polymorphism refers to the 
occurrence of two or more genetically determined alternative sequences or alleles in a 
population. A polymorphic locus may be as small as a base pair. Polymorphic markers 
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include restriction fragment length polymorphisms, variable number of tandem repeats 
(VNTR's), hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide 
repeats, tetranucleotide repeats, simple sequence repeats, and insertion elements such as 
Alu. The first identified alleleic form, or the most frequently occurring form can be 
5 arbitrarily designated as the reference (usually, "wildtype") form, and other allelic forms 
are designated as alternative (sometimes, "mutant" or "variant"). Dipolid organisms 
may be homozygous or heterozygous for allelic forms. 

An "allele" or "allelic sequence" is an alternative form of a gene which may 
result from at least one mutation in the nucleotide sequence. Alleles may result in 

1 0 altered mRNAs or polypeptides whose structure or function may or may not be altered. 
Any given gene may have none, one, or many allelic forms (polymorphism). Common 
mutational changes which give rise to alleles are generally ascribed to natural deletions, 
additions, or substitutions of nucleotides. Each of these types of changes may occur 
alone, or in combination with the others, one or more times in a given sequence. 

1 5 Several different types of polymorphisms have been reported. A restriction 

fragment length polymorphism (RFLP) is a variation in DNA sequence that alters the 
length ofa restriction fragment (Botsteine* al, Am. J. Hum. Genet 32:314-331 (1980)). 
The restriction fragment length polymorphism may create or delete a restriction site, 
thus changing the length of the restriction fragment. RFLPs have been widely used in 

2 0 human and animal genetic analyses (see WO 90/1 3668; WO 90/1 1 369; Donis-Keller, 
Cell 57:319-337 (1987); Lander et al., Genetics 727:85-99 (1989)). When a heritable 
trait can be linked to a particular RFLP, the presence of the RFLP in an individual can 
be used to predict the likelihood that the individual will also exhibit the trait. 

Other polymorphisms take the form of short tandem repeats (STRs) that include 

2 5 tandem di-, tri- and tetra-nucleotide repeated motifs. These tandem repeats are also 
referred to as variable number tandem repeat (VNTR) polymorphisms. VNTRs have 
been used in identity and paternity analysis (US 5,075,217; Armour et al, FEBSLett 
307:1 13-1 15 (1992); Horn et al, WO 91/14003; Jeffreys, EP 370,719), and in a large 
number of genetic mapping studies. 
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Other polymorphisms take the form of single nucleotide variations between 
individuals of the same species. Such polymorphisms are far more frequent than 
RFLPs, STRs (short tandem repeats) and VNTRs (variable number tandem repeats). 
Some single nucleotide polymorphisms occur in protein-coding sequences, in which 
case, one of the polymorphic forms may give rise to the expression of a defective or 
other variant protein and, potentially, a genetic disease. Other single nucleotide 
polymorphisms occur in noncoding regions. Some of these polymorphisms may also 
result in defective protein expression (e.g., as a result of defective splicing). Other 
single nucleotide polymorphisms have no phenotypic effects. 

Many of the methods described below require amplification of DNA from target 
samples and purification of the amplified products. This can be accomplished by PCR, 
for instance. See generally, PCR Technology, Principles and Applications for DNA 
Amplification (ed. H.A. Erlich), Freeman Press, New York, NY, 1992; PCR Protocols: 
A Guide to Methods and Applications (eds. Innis, et al.), Academic Press, San Diego, 
CA, 1990; Mattila et al, Nucleic Acids Res. 79:4967 (1991); Eckert et al, PCR 
Methods and Applications 1:11 (1991); PCR (eds. McPherson et al, IRS Press, 
Oxford); and US 4,683,202. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(see Wu and Wallace, Genomics 4:560 (1989); Landegren et al, Science 241:1011 

(1988) ), transcription amplification (Kwoh et al, Proc. Natl Acad, Set USA 86:\\13 

(1989) , self-sustained sequence replication (Guatelli et al, Proc. Natl Acad. Set USA 
57:1874 (1990), and nucleic acid based sequence amplification (NASBA). The latter 
two amplification methods involve isothermal reactions based on isothermal 
transcription, which produce both single stranded RNA (ssRNA) and double stranded 
DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1, 
respectively. 

Another aspect of the invention is a method for detecting a variant allele of a 
human FATP gene, comprising preparing amplified, purified FATP DNA from a 
reference human and amplified, purified, FATP DNA from a "test" human to be 
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compared to the reference as having a variant allele, using the same or comparable 
amplification procedures, and determining whether the reference DNA and test DNA 
differ in DNA sequence in the FATP gene, whether in a coding or a noncoding region, 
wherein, if the test DNA differs in sequence from the reference DNA, the test DNA 
5 comprises a variant allele of a human FATP gene. The following is a discussion of 
some of the methods by which it can be determined whether the reference FATP DNA 
and test FATP DNA differ in sequence. 

Direct Sequencing. The direct analysis of the sequence of variant alleles of the 
present invention can be accomplished using either the dideoxy chain termination 

1 0 method or the Maxam and Gilbert method (see Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed., Cold Spring Harbor Press, New York 1989; Zyskind et al., 
Recombinant DNA Laboratory Manual, Acad. Press, 1988)). 

Denaturing Gradient Gel Electrophoresis. Amplification products generated 
using the polymerase chain reaction can be analyzed by the use of denaturing gradient 

15 gel eletrophoresis. Different alleles can be identified based on the different sequence- 
dependent strand dissociation properties and electrophoretic migration of DNA in 
solution (chapter 7 in Erlich, ed. PCR Technology, Principles and Applications for DNA 
Amplification, W.H. Freeman and Co., New York, 1992). 

Single-strand Conformation Polymorphism Analysis. Alleles of target 

2 0 sequences can be differentiated using single-strand conformation polymorphism 

analysis, which identifies base differences by alteration in electrophoretic migration of 
single stranded PCR products, as described in Orita et al, Proc. Natl Acad. Scl USA 
86:21 66-2110 (1989). Amplified PCR products can be generated as described above, 
and heated or otherwise denatured, to form single-stranded amplification products. 

2 5 Single-stranded nucleic acids may refold or form secondary structures which are 

partially dependent on the base sequence. The different electrophoretic mobilities of 
single-stranded amplification products can be related to base-sequence differences 
between alleles of target sequences. 
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Detection of Binding by Protein That Binds to Mismatches. Amplified DNA 
comprising the FATP gene or portion of the gene of interest from genomic DNA, for 
example, of a normal individual is prepared, using primers designed on the basis of the 
DNA sequences provided herein. Amplified DNA is also prepared, in a similar manner, 
5 from genomic DNA of an individual to be tested for bearing a distinguishable allele. 
The primers used in PCR carry different labels, for example, primer 1 with biotin, and 
primer 2 with 32 P. Unused primers are separated form the PCR products, and the 
products are quantitated. The heteroduplexes are used in a mismatch detection assay 
using immobilized mismatch binding protein (MutS) bound to nitrocellulose. The 

1 0 presence of bio tin-labeled DNA wherein mismatched regions are bound to the 

nitrocellulose via MutS protein, is detected by visualizing the binding of streptavidin to 
biotin. See WO 95/12689. MutS protein has also been used in the detection of point 
mutations in a gel-mobility-shift assay (Lishanski, A. et al, Proc. Natl Acad. Set USA 
97:2674-2678 (1994)). 

15 Other methods, such as those described below, can be used to distinguish a 

FATP allele from a reference allele, once a particular allele has been characterized as to 
DNA sequence. 

Allele-specific probes. The design and use of allele-specific probes for 
analyzing polymorphims is described by e.g., Saiki et al, Nature 324:163-166 (1986); 

2 0 Dattagupta, EP 235,726, Saiki, WO 89/1 1 548. Allele-specific probes can be designed 
so that they hybridize to a segment of a target DNA from one individual but do not 
hybridize to the corresponding segment from another individual due to the presence of 
different polymorphic forms in the respective segments from the two individuals. 
Hybridization conditions should be sufficiently stringent that there is a significant 

2 5 difference in hybridization intensity between alleles, and preferably an essentially 

binary response, whereby a probe hybridizes to only one of the alleles. Some probes are 
designed to hybridize to a segment of target DNA such that the polymorphic site aligns 
with a central position (e.g., in a 15-mer at the 7 position; in a 16-mer, at either the 8 or 
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9 position) of the probe. This design of probe achieves good discrimination in 
hybridization between different allelic forms. 

Allele-specific probes are often used in pairs, one member of a pair showing a 
perfect match to a reference form of a target sequence and the other member showing a 
5 perfect match to a variant form. Several pairs of probes can then be immobilized on the 
same support for simultaneous analysis of multiple polymorphisms within the same 
target sequence. 

Allele-specific Primers. An allele-specific primer hybridizes to a site on target 
DNA overlapping a polymorphism, and only primes amplification of an allelic form to 

1 0 which the primer exhibits perfect complementarity. See Gibbs, Nucleic Acid Res. 

1 7:2427-2448 (1989). This primer is used in conjunction with a second primer which 
hybridizes at a distal site. Amplification proceeds from the two primers, resulting in a 
detectable product which indicates the particular allelic form is present. A control is 
usually performed with a second pair of primers, one of which shows a single base 

1 5 mismatch at the polymorphic site and the other of which exhibits perfect 

complementarity to a distal site. The single-base mismatch prevents amplification and 
no detectable product is formed. The method works best when the mismatch is included 
in the 3 '-most position of the oligonucleotide aligned with the polymorphism because 
this position is most destabilizing to elongation from the primer (see, e.g., WO 

2 0 93/22456). 

Gene Chips. Allelic variants can also be identified by hybridization to nucleic 
acids immobilized on solid supports (gene chips), as described, for example, in WO 
95/1 1995 and U.S. Patent No. 5,143,854, both of which are incorporated herein by 
reference. WO 95/1 1995 describes subarrays that are optimized for detection of a 
2 5 characterized variant allele. Such a subarray contains probes designed to be 

complementary to a second reference sequence, which is an allelic variant of the first 
reference sequence. 
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The present method is illustrated by the following examples, which are not 
intended to be limiting in any way. 

EXAMPLES 
Materials and Methods 
5 The following Materials and Methods were used in the work described in 

Examples 1-5, 

Sequence Alignment of F ATP Clones. The DNA sequence for mouse FATP1 
was obtained from the National Center for Biotechnology Information nonredundant 
database. cDNAs for mmFATP2, 3, 4, and 5 were obtained by screening mouse 

1 0 expression libraries (purchased from GIBCO/BRL) with probes derived from the cloned 
expressed sequence tags (ESTs) (Research Genetics, Huntsville, AL). Full-length 
clones were obtained for mmFATP2 and 5 and partial sequences for mmFATP3 and 4. 
The sequences described herein have been deposited in the GenBank database 
(Accession Nos. FATP2, AF072760; FATP3, AF072759; FATP4, AF072758; FATP5, 

15 AF072757). 

Neither FATP2 nor FATP5 contains an in-frame stop codon upstream of the 
putative initiator methionine; initiator methionines were assigned by homology with 
that in mmFATPl and by the presence of a signal sequence immediately after it. The 
Mycobacterium tuberculosis, Caenorhabditis elegans, and Saccharomyces cerevisiae 
2 0 sequences were present in the dbEST database as part of the sequencing projects for 
these organisms. Sequences were aligned utilizing a ClustalX algorithm and the 
resulting alignment exported to SeqVu. Homologous amino acid substitutions are 
boxed in Figure 1 and were determined using the Dayhoff 250 method with a 50% 
homology cutoff. 

2 5 Cell Transfection and LCFA Uptake. COS cells were cotransfected using the 

DEAE-dextran method with the mammalian expression vector pCDNA 3.1 (Invitrogen) 
expressing the gene for CD2 (pCDNA-CD2) in combination with either a pCDNA 3.1 
or pCMVSPORT2 (GIBCO/BRL) expression vector containing one of the murine or 
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nematode FATP genes (pCDNA-mmFATPUpCDNA-FATP2,pCMVSPORT-FATP5, 
pCDNA-ceFATPb). Two days after transfection, cells were assayed for CD2 expression 
with a phycoerythrin-coupled anti-CD2(PE-CD2) monoclonal antibody (PharMingen), 
and fatty acid uptake was assayed with a BODIPY-labeled fatty acid analogue 
5 (Molecular Probes). Briefly, cells were washed twice with PBS (phosphate buffered 
saline) and stained with PE-CD2 at 4°C for 30 min in PBS containing 10% fetal calf 
serum. They were then washed three times with PBS/fetal calf serum for 5 min 
followed by an incubation for 2 min at 37°C in fatty acid uptake solution, which 
contained 0.1 |iM BODIPY-FA and 0.1% fatty acid-free BSA (bovine serum albumin) 

10 in PBS (Schaffer, J.E. & Lodish, H.F. (1994) Cell 79: All -436). After 2 min, the cells 
were washed four times with ice-cold PBS/0.1% BSA. The cells were then removed 
from the plates with PBS containing 5 mM EDTA and resuspended in PBS containing 
10% fetal calf serum and 10 mM EDTA. PE-CD2 and BODIPY-FA fluorescence were 
measured using a FACScan (Becton Dickinson). COS cells were gated on forward 

15 scatter (FSC) and side scatter (SS). Cells exhibiting more than 300 CD2 fluorescence 
units (dsim) representing 15% of all cells were deemed CD2 positive and their 
BODIPY-FA fluorescence was quantitated. 

E. co/z-Based LCFA Uptake Assay. The full-length coding region of mtFATP 
and a control protein, the mammalian transcription factor TFE3, were subcloned into the 

2 0 inducible, prokaryotic expression vector pET (Novagen). Expression was induced with 
1 mM isopropyl |3-D-thiogalactoside (IPTG) for 1 hour, or cells were left uninduced. 
Cells were washed in PBS/0.1% BSA and resuspended in 1 ml PBS/0.1% BSA 
containing 0.1 \lM [ 3 H]palmitate (NEN) at 37°C. Uptake was stopped after the 
indicated incubation time by transferring the cells onto filter paper using a cell harvester 

2 5 (Brandel, Bethesda, MD). Filters were washed extensively with ice-cold PBS/0. 1% 
BSA, and [ 3 H]palmitate was quantitated by scintillation counting. 

Northern Blots. Northern blot analysis of murine FATP expression was done 
using poly(A) mRNA blots (Clontech). Probes of each of the FATPs were derived from 
the y untranslated regions of each gene and were <60% identical in sequence. Probes 
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were labeled by random priming (Boehringer Mannheim) and hybridized at 65°C. 
Blots were extensively washed in 0.2% SSC/0.1% SDS at 65°C. 

Generation of Phylogenetic Trees. Complete and partial sequences for FATP 
genes from human, rat, mouse, puffer fish, Drosophila melanogaster, C. elegans, S. 
5 cerevisiae, and M. tuberculosis were aligned using ClustalX. A homologous region of 
48 amino acids (residues 472-519 in mmFATPl) from all of the genes was used to 
determine phylogenetic relationship within ClustalX. Based on these data a 
phylogenetic tree was generated using Tree View PPC (Figure 5). 

Nomenclature. It is proposed that the FATP genes be given a species specific 
1 0 prefix (mm, Mus musculus; hs, Homo sapiens; mt, M tuberculosis; dm, D. 

melanogaster; ce, C. elegans, sc, S. cerevisiae) and numbered such that mammalian 
homologues in different species share the same number but differ in their prefix. Since 
the two C elegans genes cannot be paired with a specific human or mouse FATP, they 
have been designated ceFATPa and ceFATPb. 

1 5 Example 1 : Identification of Novel Mammalian FATPs 

The National Center for Biotechnology Information EST database was screened, 
using the mouse FATP protein sequence (mmFATPl), to identify novel FATPs. This 
strategy led to the identification of more than 50 murine EST sequences which could be 
assembled into five distinct contiguous DNA sequences (contigs). One contig was 

2 0 identical to the previously cloned FATP, which has been renamed FATP1 . Another, 
which has been renamed FATP2, is the murine homologue of a rat gene previously 
identified by others as a very long chain acyl-CoA synthase (Uchiyama, A., Aoyama, 
T., Kamijo, K., Uchida, Y., Kondo, N., Orii, T. & Hashimoto, T. (1996) Biol Chem, 
271 :30360-30365). The other three contigs represented novel genes (FATP3, 4, and 5). 

2 5 Full-length clones for FATP2 and FATPS and nearly complete sequences for FATP3 
and 4 (Figure 1) were obtained by screening cDNA libraries made from mouse day 
10.5 embryos and adult liver. Also identified were human homologues for each of the 
murine genes in the EST database. A sixth human gene was also identified; whether 
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this gene is also present in the mouse will require additional studies. Map positions are 
given in Tables 2 and 3. 

The genetic loci for all of the human genes, with the exception of FATP5 which 
was already mapped as an unknown EST, were determined using the radiation hybrid 
5 panels. The map positions given below show the distance (in centiRays) from the 
closest framework marker. As a guideline, there are approximately 300kb/cR. 



Table 2. Mapping Data for Human Genes 



hsFATPl 



10 hsFATP2 



hsFATP3 



15 



hsFATP4 

hsFATPS 
hsFATP6 



Chromosome Chrl9 

places 13.35 cR from WI-6344 (lod>3.0) 
Chromosome Chrl5 
places 4.92 cR from D15S126 (lod>3.0) 
Chromosome Chrl 

places 13.24 cR from WI-2862 (lod>3.0) 
Chromosome Chr9 

places 7.80 cR from WI-9685 (lod>3.0) 

unknown EST previously mapped to near D19S418 

Chromosome Chr5 

places 1.41 cRfrom WI-4907 (lod>3.0) 



The mouse map is an internal backcross panel consisting of 188 mouse 
2 0 backcross DNA's plus 4 controls (B6, Spretus, Fl, Water). The backcross was 

constructed by crossing B6 by Spretus animals and then crossing those FTs back to B6. 

Mapping is accomplished by taking advantage of recombinational events during 

meiosis, and the use of PCR primers to detect the differences (by size or re-annealing 

events) at any given locus between the B6 and Spretus allele. 
2 5 For the purposes of mapping, a novel set of primers (gene of interest) is used to 

amplify from all 188 DNA's and then typed as being a B6 ("B") or a Spretus ("S"). This 
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string of B's and S's is entered into the Map Manager program, which does a best fit 
calculation by comparing the string of 188 typings from the gene of interest to all loci 
already extant in the panel, for all 20 chromosomes. The gene of interest is then 
assigned to a particular area on a particular chromosome according to a number of 
5 parameters, including the minimalization of double cross-overs, and the highest LOD 
scores. Indicated in Table 3 are distances to the closest markers on either side of the 
FATP locus. 

Table 3. Mapping Data for Mouse Genes 
Chromosome 8 

places 2.82 cM from D8Mitl32 (lod 43.4) and 1.81 cM from D8Mit74 
God 43.5) 

Chromosome 2 

places 1.29 cM from D2Mit258 (lod 47.9) and 1.75 cM from D2NDS3 
(lod 44.9) 
Chromosome 3 

places 2.54 cM from D3Mit22 (lod 29.5) and 19.62 cM from D3Mit42 
(lod 13.6) 
Chromosome 2 

places 13.78 cM from D2Mitl (lod 22.9) and 3.85 cM from D2Mit65 
(lod 41.9) 
Chromosome 7 

places 7.28 cM proximal of D7Mit21 (lod 28.3) 

Assessment of Function 
The ability of the newly identified mouse genes to function as fatty acid 
2 5 transporters was assessed using a fluorescence-activated cell sorting-based assay. COS 
cells were transiently cotransfected with expression vectors encoding the cell surface 
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protein CD2 and either mmFATPl, mmFATP2, or mmFATP5 5 respectively. Two days 
after transfection, COS cells were stained with an antibody to CD2 and then incubated 
with a BODIPY-labeled fatty acid [BODIPY-FA, (Schaffer, J.E. & Lodish, H.F. (1994) 
Cell 79:427-436)], The cells were then washed extensively, lifted off the dish, and 
5 analyzed by fluorescence-activated cell sorting. As judged by the number of CD2- 
positive cells, the transfection efficiency was approximately 20-30%. Fatty acid uptake 
was quantitated in the transiently transfected COS cells by measuring the BODIPY-FA 
fluorescence of the CD2-positive cells. Expression of CD2 had no effect on fatty acid 
uptake as shown by the finding that COS cells expressing only the transfected CD2 
1 0 cDNA (CD2-positive) had the same low level of BODIPY-FA uptake as did 
untransfected (CD2-negative) control cells (Figure 2A, control). In COS cells 
cotransfected with CD2 and mmFATPl, mmFATP2, or mmFATP5, uptake of 
BODIPY-FA by the transfected (CD2-positive) cells was increased between 15- to 90- 
fold over control (CD2 cDNA only) cells (Figures 2A-2D). 

1 5 Example 3 : Expression Patterns of Murine FATPs 

Expression patterns of members of the murine FATP gene family were 
characterized by Northern blot analysis; to avoid cross-hybridization, the probes used 
were from the 3 1 untranslated region of these genes, which are less than 60% identical in 
sequence. The expression pattern of FATP 1 agrees with that previously found 

2 0 (Schaffer, J.E. & Lodish, H.F. (1994) Cell 79:421-436), Here, expression was seen 
primarily in heart and kidney. FATP2 is expressed almost exclusively in liver and 
kidney, which corresponds to the reported tissue distribution of the rat homologue [very 
long chain acyl-CoA (VLACS)] as assessed by Western blotting (Uchiyama, A., 
Aoyama, T., Kamijo, K., Uchida, Y., Kondo, N., Orii, T. & Hashimoto, T. (1996) 

2 5 Biol Chem. 271 :30360-30365). FATP3 is present in lung, liver, and testis. FATPS is 
expressed only in liver and cannot be detected in other tissues even when the blot is 
overexposed. The human homologue of FATPS is also liver specific and is not 
expressed in a wide array of other tissues tested, including fetal liver. 



WHI9721p3MC2 



-70- 



Example 4: FATPs Are Evolutionary Conserved 

The EST database was searched, using sequences conserved among the five 
murine FATP genes, for FATP genes in other organisms. Two homologues were found 
in C elegans and one in M tuberculosis. One of the C. elegans genes was cloned from 
5 a cDNA library and expressed in COS cells, as described for the murine FATPs. 
Overexpression of the nematode FATP resulted in a 15-fold increase of BODIPY-FA 
uptake compared with control cells (Figure 3). The mycobacterial FATP gene was 
isolated from a phage library and assessed for its ability to facilitate fatty acid uptake. 
E. coli transformed with a prokaryotic, isopropyl P-D-thiogalactoside-inducible 
1 0 expression vector containing the mycobacterial FATP gene demonstrated a significant 
increase in the rate of [ 3 H]palmitate uptake after induction, compared with uninduced 
bacteria or E. coli transformed with a control protein (Figure 4). Novel FATP genes 
were also identified in F. rubripes (puffer fish) and Z). melanogaster. 



ExampleS: Phylogenetic Tree of FATPs 

15 Faergeman et al (Faergeman, N.J., DiRusso. C.C., Elberger, A., Knudsen, J. & 

Black, P. N. (1997) /. Biol Chem. 272:8531-8538) identified three regions of very 
strong conservation between the scFATP and mmFATPl genes. The sequences of the 
FATPS were compared over a 31 1 -amino acid FATP "signature sequence" which 
includes these conserved regions corresponding to amino acids 246-557 in mmFATPl 

2 0 (underlined in Figure 1). When compared with the National Center for Biotechnology 
Information nonredundant database, only one region of the "FATP signature sequence" 
shows significant homology to other proteins. This small stretch of amino acids 
(underlined in Fig. 1) is an AMP-binding motif found in a multitude of other proteins, 
such as acyl-CoA synthase, several CoA lipases, and gramicidin S synthetase 

2 5 component II (Schaffer, J.E. & Lodish, H.F. (1994) Cell 79:427-436). The relevance of 
this motif to fatty acid transport is unclear. Other highly conserved regions among the 
FATPs, including long stretches of amino acids >90% identical from mycobacteria to 
humans, are not found in any other class of proteins. A 48-amino acid segment of the 
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FATP signature sequence was used to construct a phylogenetic tree (Figure 5). Each of 
the human and mouse genes form their own branch; hsFATP6, which as yet has no 
murine homologue, is most closely related to hsFATP3 and mmFATP3. As expected, 
rnVLACS is closer in sequence to mmFATP2 than to hsFATP2. The FATP genes of 
5 invertebrates i.e., C elegans and D. melanogaster, are most closely related to each 
other. Surprisingly, the mycobacteral gene is more closely related to the human and 
mouse FATP 5 genes than to the FATPs of any of the lower organisms. Whether this 
reflects coevolution of the mycobacterial and human genes awaits further study. 

Materials and Methods 
1 0 The following materials and methods were used in the work described in 

Examples 6-10. 

Isolation of full-length human FATP1 and 4 

Full-length clones encoding human FATP1 and human FATP4 were identified 
by searching databases for sequences similar to murine FATP 1-5 coding regions using 

15 the BlastX algorithm (Altschul et al 9 J. Mol Biol 215: 403-410, 1990). 

A concatamer of nucleotide sequences comprising the coding sequences of 
mmFATPl (Genbank Accession U15976), mmFATP2, mmFATP3 (SEQ ID NO:6), 
mmFATP4 (SEQ ID NO:8) and mmFATPS (SEQ ID NO:10) was used to search the 
Millennium database using the BLASTX algorithm. Sequences with a score >1 50 were 

2 0 evaluated for whether they represented known FATP coding sequences. 

Human clones with similarity to the 5' end of murine FATP sequences were 
sequenced completely. Clones encoding full-length human FATP1 were obtained from 
a heart cDNA library constructed in the mammalian expression vector pMET7 
(Tartaglia etal., Cell 83: 1263-1271, 1995). Clones encoding full-length human 

2 5 FATP4 were obtained from a spleen cDNA library constructed in the mammalian 
expression vector pMET7. 
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Isolation of full-length human FATP6 

Several clones encoding human FATP6 were identified by searching public 
databases as described above. Five clones were analyzed further by restriction digestion 
and DNA sequencing. One of these clones (Genbank Accession # AA4 12064) appeared 
5 to be full-length and its entire insert was sequenced. 

DNA Sequence Analysis 

Sequences were aligned with the DNAStar program using the Clustal method. 
Hydrophobicity plots were generated with DNA Strider using the Kyte Doolittle 
method, 

10 In situ hybridization 

Tissues were collected from 8 week old C57/B16 mice. Tissues were fresh 
frozen, cut on a cryostat at 10 |im thickness and mounted on Superfrost Plus slides 
(VWR), Sections were air dried for 20 minutes and then incubated with ice cold 4% 
paraformaldehyde (PFA)/phosphate buffered saline (PBS) for 10 minutes. Slides were 

15 washed 2 times 5 minutes with PBS, incubated with 0.25% acetic anhydride/1 M 
triethanolamine for 10 minutes, washed with PBS for 5 minutes and dehydrated with 
70%, 80%, 95% and 100% ethanol for 1 minute each. Sections were incubated with 
chloroform for 5 minutes. Hybridizations were performed with 35 S-radiolabeled (5x1 0 7 
cpm/ml) cRNA probes generated from the 3 ! untranslated regions of mouse FATPs by 

2 0 PCR followed by in vitro transcription in the presence of 50% formamide, 10% dextran 
sulfate, lx Denhardt's solution, 600 mM NaCl, 10 mM DTT, 0.25% SDS and 10 |ig/ml 
tRNA for 1 8 hours at 55°C. After hybridization, slides were washed with 10 mM Tris- 
HC1 pH 7.6, 500 mM NaCl, 1 mM EDTA (TNE) for 10 minutes, incubated in 40 
jig/ml RNase A in TNE at 37°C for 30 minutes, washed in TNE for 10 minutes, 

2 5 incubated once in 2x SSC at 60°C for 1 hour, once in 0.2x SSC at 60°C for 1 hour, once 
in 0.2x SSC at 65°C for 1 hour and dehydrated with 50%, 70%, 80%, 90% and 100% 
ethanol. Localization of mRNA transcripts was detected by dipping slides in Kodak 
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NBT-2 photoemulsion and exposing for 7 days at 4°C, followed by development with 
Kodak Dektol developer. Slides were counter stained with haematoxylon and eosin and 
photographed. Controls for the in situ hybridization experiments include the use of a 
sense probe which showed no signal above background in all cases. 

5 Northern Blotting 

Human mRNA blots were obtained from Invitrogen or Clontech. PCR fragments 
from the 3 f untranslated regions of human FATPs were used as probes. Blots were 
probed with 32 P-labeled DNA probes using the Rapid-Hyb buffer (Amersham) 
according to the manufacturer's instructions. 

1 0 Cell transfection and LCFA uptake. COS cells were cotransfected, using 

lipofectamine (GIBCO BRL) according to the manufacturer's instructions, with the 
mammalian expression vector pCDNA3.1 (Invitrogen) expressing the gene for CD2 in 
combination with a pMET7 expression vector (Tartaglia et al, Cell, 83:1263-1271, 
1995) containing hsFATPl (pMET7-hsFATP 1 ) or hsFATP4 (pMET7-hsFATP4) or 

15 pMET7 alone. Two days after transfection, cells were assayed for CD2 expression with 
a phycoerythrin-coupled anti-CD2 (PE-CD2) monoclonal antibody (PharMingen), and 
fatty acid uptake was assayed with a BODIPY-labeled fatty acid analog (Molecular 
Probes) as described above. 

Example 6: Determination of Expression of mmFATPs 

2 0 mmFATP4, and to lesser extent mmFATP2, are expressed at high levels in the 

brush border layer of the small intestine. 

Cell transfection and LCFA uptake. COS cells were cotransfected, using 
lipofectamine (GIBCO BRL) according to the manufacturer's instructions, with the 
mammalian expression vector pCDNA3.1 (Invitrogen) expressing the gene for CD2 in 

2 5 combination with a pMET7 expression vector (Tartaglia et al, Cell, 83:1263-1271, 
1995) containing hsFATPl (pMET7-hsF ATP 1 ) or hsFATP4 (pMET7-hsFATP4) or 
pMET7 alone. Two days after transfection, cells were assayed for CD2 expression with 
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a phycoerythrin-coupled anti-CD2 (PE-CD2) monoclonal antibody (PharMingen), and 
fatty acid uptake was assayed with a BODIPY-labeled fatty acid analog (Molecular 
Probes) as described above. 

Absorption of dietary fat requires transport of free fatty acids across the apical 
5 membrane of epithelial cells in the small intestine. Previous studies suggested that this 
transport is protein-mediated; however, the transport protein had not yet been identified. 
In situ hybridization was performed on each of the three regions of the small intestine - 
duodenum, jejunum and ileum as well as the colon, using probes from the 3' 
untranslated regions of mmFATPl, mmFATP2, mmFATP3, mmFATP4 and 

1 0 mmFATPS, to determine whether any of the mouse FATPs are expressed in the small 
intestine. It was expected that a protein involved in fatty acid absorption would be 
expressed in the epithelial cells of the small intestine, but absent from the colon. 

Expression of mmFATPs in the jejunum was identical to that in the ileum in all 
cases. High levels of mmFATP4 mRNA were present in the epithelial cells of the 

15 jejunum and ileum, and lower, but significant, amounts were detected in the epithelial 
cells of the duodenum. Significantly, FATP4 mRNA was absent from other cell types 
of the small intestine and no FATP4 mRNA could be detected in any of the cells of the 
colon. FATP2 mRNA was present in the epithelial cells of the duodenum at a level 
similar to that of FATP4, but was present at lower levels in the jejunum and ileum. No 

2 0 signals above background were detected for mmFATPl , mmFATP3 and mmFATPS in 
any of the intestinal tissues. mmFATP3 and FATPS were clearly detectable by in situ 
hybridization in adult liver and mmFATPl could be detected in a variety of tissues on a 
whole embryo in situ, indicating that the FATP1, 3, and 5 probes were working. 

mmFATP4 expression is predominant in the small intestine compared to the 

2 5 other organs of the mouse embryo. In the small intestine, FATP4 expression is limited 
to differentiated enterocytes, while no signal is detected in the connective tissue or the 
undifferentiated epithelial cells in the crypts. Differentiated enterocytes are known to be 
the cells that mediate the uptake of fatty acids. FATP4 is specifically and strongly 
expressed in the epithelial cells of adult murine duodenum and ileum but not colon. 
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Other FATPs, such as FATPS, are not expressed in the small intestine. Thus, FATP4 is 
the major FATP in the mouse small intestine. Given its high level of expression, it is 
likely that FATP4, and to a lesser extent FATP2, play an important role in the 
absorption of fatty acids. 

5 mmFATP2, and mmFATPS are expressed in hepatocytes 

Northern analysis of mmFATP2, mmFATP3, mmFATP4 and mmFATPS 
showed expression in the liver. To determine whether these proteins are present in 
hepatocytes or other cells types present in liver homogenates, in situ hybridizations 
were performed. mmFATP2, and mmFATPS mRNA was clearly present in 
1 0 hepatocytes, and was not concentrated in other cell types such as endothelial cells or 
macrophages. No signal above background was detected for mmFATPl in any of the 
cell types in the liver, consistent with the results of the Northern blotting. 

Example 7: Isolation and Sequence Analysis of Full-length Human FATP1 and Full- 
length Human FATP4 

15 To identify human cDNA clones encoding FATP family members, Millennium 

databases were searched for sequences similar to murine FATP 1-5 coding regions. Two 
clones were analyzed in detail; inspection of the entire DNA sequence of these two 
clones showed that they encode the human orthologs of mmFATPl and mm FATP4, 
respectively. These two clones were designated hsFATPl and hsFATP4, and their 

2 0 DNA and predicted protein sequences are shown in Figures 44A-44C and 45, and 50A- 
50C and 51. hsFATPl is predicted to encode a 646 amino acid, 71 kD protein with 
multiple membrane-spanning domains (Figure 28A). HsFATP4 is predicted to encode a 
643 amino acid, 72 kD protein with multiple membrane spanning domains (See Figure 
29A). A comparison of the DNA sequences of mouse and human FATP1 and mouse 

2 5 and human FATP4 (Figures 30A-30B and 3 1 A-3 IB) shows that the mouse and human 
orthologs are 85% (FATP1) and 87% (FATP4) identical to each other within the coding 
sequences given in these figures. At the amino acid level, hsFATPl and hsFATP4 are 
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-90% identical to their respective mouse orthologs within the coding region shown in 
these figures (Figures 32 and 33). The sequence identities between mouse and human 
FATP1 and FATP4 are considerably higher than the ones observed between different 
FATP family members within one species (~40%-60%) and are present in the N- 
5 terminal part of the protein, a region that is poorly conserved between different FATP 
family members. This high degree of sequence conservation clearly demonstrates that 
the newly identified human FATPs are orthologs of mouse FATP1 and FATP4 rather 
than novel FATP family members. 

Table 4 is an identity/similarity matrix comparing the amino acid sequences of 
1 0 FATP1 and 4 from human and mouse. This shows that the gene whose sequence is 

shown in Figure 43 A is indeed human FATP4, since it is 91% identical with the murine 
FATP4 but only 62% identical with the closest related human FATP, which is FATP1 . 



Table 4 


Identity/Similarity Matrix 




hsFATP4 


mmFATP4 


hsFATPl 


mmFATPl 


hsFATP4 




93.2 


72.3 


72.0 


mmFATP4 


91.0 




71.2 


71.1 


hsFATPl 


61.9 


61.0 




92.4 


mmFATPl 


60.7 


59.6 


89.5 





Example 8: Isolation and Sequence Analysis of Full-length Human FATP6 
20 A search of EST databases identified a set of overlapping human sequences that 

were similar to FATPs, but did not have a clear mouse ortholog. One of these EST 
clones was found to encode a full-length cDNA. The entire insert of this clone was 
sequenced and designated hsFATP6. The DNA and predicted protein sequences of 
hsFATP6 are shown in Figures 54A-54C and 55. HsFATP6 is predicted to encode a 
25 619 amino acid, 70 kD protein with multiple membrane-spanning domains (Figure 
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3 5 A). A comparison of the amino acid sequences of hsFATP6 with other human 
FATPs shows about 37% identity to either hsFATPl or hsFATP4 (Figure 36). This 
degree of sequence identity is similar to what is observed between different mouse 
FATPs. The phylogenetic analysis described above clearly demonstrates that hsFATP6 
5 is a member of the FATP family, but not an ortholog of any of the mouse FATPs. 

Comparisons were done with "ALIGN" (E. Myers and W. Miller, "Optimal Alignments 
in Linear Space " CABIOS 4:1 1-17 (1988) using standard settings. 

Example 9: Tissue Distribution of Human FATPs 

The tissue distribution of human FATPs was assessed by Northern blotting. 

1 0 Human FATP3 was expressed in a large variety of tissues. In contrast, human FATPS 
was present at high levels in the liver, but was undetectable in all other tissues 
examined. Thus, both hsF ATP3 and hsFATPS recapitulate the expression pattern of 
their mouse orthologs (see above). HsFATP6 is a novel FATP with no mouse ortholog 
as yet. Northern blotting shows that hsFATP6 is expressed at high levels in the heart, 

15 but is undetectable in other tissues, including skeletal and smooth muscle. This tissue 
distribution suggests that human FATP6 performs an important role in energy 
metabolism in the heart; blocking FATP6-mediated fatty acid transport may therefore 
be beneficial for a number of heart diseases, e.g., ischemic heart disease. 

To identify the major FATP expressed in the human small intestine, Northern 

2 0 blotting was performed on a blot containing mRNA from human stomach, jejunum, 
ileum, colon, rectum and lung. hsFATPS and hsFATP6 were undetectable in any of 
these tissues. FATPS is only expressed in liver and FATP6 only in heart. hsFATP2 
was weakly expressed in the colon, and an even weaker signal was detectable in 
jejunum, ileum and lung lanes. hsFATP3 was expressed well in the lung, but was only 

2 5 weakly expressed in the other tissues tested. Importantly, no difference was seen in the 
expression of hsFATP3 between small intestine and stomach or colon, suggesting that 
the expression observed is not related to fatty acid absorption in the small intestine. 
hsFATP4 was clearly expressed in both jejunum and ileum; expression was 
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significantly lower in the colon and was absent in the stomach. This expression pattern 
is consistent with a major role for FATP4 in absorption of fatty acids in the human gut. 

Example 10: Expression of hsFATPl and hsFATP4 Promotes Transport of Fatty Acids 
COS cells were cotransfected using lipofectamine with the mammalian 
5 expression vector pCDNA-CD2 in combination with one of the FATP-containing 

expression vectors (pMET7-hsFATPl or pMET7-hsFATP4) or an insertless expression 
vector (pMET7, control) as described in Materials and Methods for Examples 6-10. 
COS cells were gated on forward scatter and side scatter. Cells exhibiting more than 
400 CD2 fluorescence units representing -30% of all cells were deemed CD2-positive. 
1 0 The percent of CD2-positive cells exhibiting a BODIPY-fluorescence of >300 is plotted 
for the three different vectors tested (Figure 37). 

Example 1 1 : Stable Expression of Human FATP4 in 293 Cells 

Stable cell lines were generated as follows. A DNA fragment containing the 
entire hsFATP4 coding sequence as well as 100 nucleotides of 5' and 50 nucleotides of 

15 3' untranslated region was inserted into the vector pIRES-neo (Clontech) using standard 
cloning techniques. The resulting construct or a vector control (pIRES-neo) was 
transfected into 293 cells using the lipofectamine method (Gibco BRL) according to the 
manufacturer's directions. Cells that had taken up the DNA were selected with 1 mg/ml 
G418 (Gibco BRL). Single colonies were picked 1 to 2 weeks after transfection and 

2 0 grown in medium containing 0.8 mg/ml G41 8. Colonies were screened for the ability to 
take up fatty acids by measuring uptake of a fluorescently labeled fatty acid (BODIPY- 
FA). About 40 colonies transfected with the pIRES-neo containing FATP4 and -20 
colonies transfected with pIRES-neo control were analyzed. All 20 of the vector control 
clones showed amounts of BODIFY-FA uptake similar to each other and to 

2 5 untransfected 293 cells. In contrast, among the 40 FATP4 transfected clones, 3 had a 5- 
to 10-fold increased BODIPY-FA uptake compared to any of the vector controls, and a 
large number (~20) showed an approximately two-fold increase in BODIPY-FA levels. 
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This distribution is consistent with FATP4 conferring increased fatty acid uptake in 
these cells. One of the cell lines with the highest amount of BODEPY-FA uptake was 
selected to be used for measuring uptake of tritiated fatty acid. 

The uptake of tritiated oleate over time by either FATP4 expressing or control 
5 cells was assayed over time. Expression of FATP4 increases the rate of fatty acid 
uptake by over 3 -fold, demonstrating that FATP4 is, like the other FATPs, a functional 
fatty acid transporter (Figure 38). 

Example 12: Immuno-staining with FATP4-Specific Antiserum 

A polyclonal antiserum against the C-terminus of mmFATP4 was raised using a 

1 0 GST-fusion protein having mmFATP4-specific amino acid sequence 552-643 

(AVASP...GEEKL). In western blot experiments, the purified antibody reacted strongly 
with a synthetic peptide matching the C-terminus of mmFATP4, but not with a 
corresponding region of mmFATP2, mmFATP3, or mmFATPS. The mmFATP4 
specific polyclonal antiserum detects, in western blot experiments with enterocyte 

1 5 lysates from 3 different mice, a - 70 kDa protein, which is in accordance with 
mmFATP4's predicted molecular weight of 72 kDa. The binding is specific for 
mmFATP4, since it can be completely abolished by preincubation of the antiserum with 
the GST-fusion peptide used to raise the antibody. 

Immunofluorescence experiments were performed using the anti-mmFATP4 

2 0 antiserum on fresh frozen sections of murine small intestine. The antibody binding 
demonstrates strong expression of mmFATP4 in enterocytes, confirming the results of 
the in situ hybridization experiments. At higher magnifications it is apparent that 
mmFATP4 is expressed at the apical side of the enterocyte, indicating that the 
transporter is present in the brush border membrane, which is known to mediate the 

2 5 uptake of fatty acids from the intestinal lumen. 

Immuno-electron microscopy studies were performed on fresh frozen murine 
intestinal cells. The gold particles used, appearing as black specks on the electron 
micrographs, indicate the subcellular localization of mmFATP4 to be on the microvilli 
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of the enterocyte. It can be seen from electron micrographs that mmFATP4 is localized 
exclusively in membranes, preferentially the apical plasma membrane, confirming that 
it is indeed a membrane protein. 

Methods for Immunofluorescence and Immunogold Electron Microscopy 

Unfixed mouse small intestine was washed with Hank's buffered salt solution 
containing 1 mM EDTA, infused with 2.3 M sucrose solution, and embedded in O.C.T., 
4583 compound. The material was thick sectioned (15 |lM - 40 |lM). The sections 
were washed in PBS containing 1% BSA and 0.075% glycine to block non-specific 
binding. Primary and secondary antibodies were diluted in PBS with 10% FCS and 
incubated for lh. The sections were mounted in 90% glycerol/PBS containing 1 mg/ml 
paraphenylinediamine, and examined with a Bio-Rad MRC 600 confocal, mounted on a 
Zeiss Axioscop. 

For the immunogold labeling, the tissue was fixed with 2% paraformaldehyde in 
PBS for 10 minutes, after which it was cryoprotected by infiltration with 2.3 M sucrose 
in 0.1 M phosphate buffer (pH 7.4) containing 20% polyvinylpyrrolidone, and then 
mounted on aluminum cryo nails and frozen in liquid nitrogen (Tokuyasu, K.T., J. 
Microscop. 743:139-149, 1986). Ultrathin sections were collected on carbon/formvar- 
coated nickel grids. The primary antibody (anti-FATP4) was diluted in 10% FCS in 
PBS and incubated overnight at 4 C, followed by donkey anti-rabbit IgG-gold (12 nm) 
(Jackson Labs) for lh. The sections were stained in 2% neutral uranyl acetate (20 
minutes) and absorption stained with 2% uranyl acetate in 0.2% methylcellulose 
containing 3.2% polyvinyl alcohol. The sections were examined with a Philips EM 410 
electron microscope. 

Example 13: Inhibition of Fatty Acid Uptake Specific to FATP4 Demonstrated in 
Isolated Mouse Enterocytes 

Phosphorothioate derivatives of the following oligonucleotides were 
synthesized: 
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FATP4-S2 



FATP4-AS2 



FATP4-AS2MM 



CCCCCACCAGAGAGGCTCC (SEQ ID NO:103) 
CCACCCCCGGAAAGCCTGC (SEQ ID NO:104) 
GGAGCCTCTCTGGTGGGGG (SEQ ID NO:105) 



10 



15 



20 



FATP4 AS2 is the antisense oligo; it is designed to be complementary to the sequence 
extending from nucleotide 10 to nucleotide 28 of the mouse FATP4 coding sequence. 
FATP4-AS2MM is a control oligo; in the oligo every third nucleotide was changed 
creating mismatches; the overall nucleotide composition is identical to FATP4-AS2 
(same number of G, A, T, C). FATP4-S2 is the sense control. 

Enterocytes were isolated from the small intestine of mice and incubated for 48h 
in tissue culture (Figure 40) either without oligonucleotides (squares) or with 100 (iM 
FATP4 specific sense (circles) or antisense (diamonds) oligonucleotides. The uptake 
over time of 25 |lM oleate was then measured. While the FATP4 sense oligonucleotide 
did not significantly influence the uptake, the antisense oligonucleotide inhibited fatty 
acid uptake by ~ 50%. 

The effect of either FATP4 sense, antisense or mismatch sequence 
oligonucleotides on the uptake of fatty acids was measured in enterocytes. Isolated 
enterocytes were incubated with increasing concentrations of FATP4 antisense 
oligonucleotides (solid bars in Figure 41), or a mismatch control oligonucleotide with 
identical nucleotide composition (stippled bars), or with 100 |IM of the FATP4 sense- 
oligonucleotide (lined bar). The medium for this incubation was Dulbecco ! s modified 
Eagle's medium with 4.5 g/L glucose, 1 mM sodium pyruvate, 0.01 mg/ml human 
transferrin and 10% fetal bovine serum. After 48 hours of incubation the uptake of 
oleate by enterocytes was measured over a 5 minute time interval. Measurements were 
done in quadruplicate. The uptake assay was done in Hank's buffered salt solution with 
10 mM taurocholate. Only the enterocytes given FATP4 antisense oligonucleotide 
showed a concentration dependent decrease of fatty acid uptake, inhibiting it at a 100 
|lM concentration by -50%. This effect was FATP4 specific, since only the antisense 
oligonucleotide which can bind to the FATP4 mRNA and block its translation inhibited 
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uptake, but not a control oligonucleotide differing only in the sequence but not the 
nucleotide content, ruling out a toxic or otherwise nonspecific inhibitory effect of this 
oligonucleotide due to its chemical composition. 

As a further control experiment, the uptake of oleate was measured along with 
5 the uptake of methionine in the same cultured enterocytes. Antisense oligonucleotide, 
mismatch sequence oligonucleotide, or no oligonucleotide was added to a concentration 
of 100 to cultures of enterocytes. After incubation for 48 hours, the uptake of both 
3 H-labeled oleate and 35 S-labeled methionine was assayed. Results are shown in Figure 
42. Fatty acid uptake is at the left side of the paired bars; methionine uptake is on the 
1 0 right side of the paired bars. The fact that amino acid uptake was not influenced by the 
antisense oligonucleotide treatment further supports the conclusion that the antisense 
oligonucleotide causes a specific reduction in translation of FATP4-specific mRNA. 

Example 14: mmFATP2 Is Expressed in Proximal Renal Tubule Epithelium 

Northern analysis showed that mmFATPl, mmFATP2, and mmFATP4 are 

1 5 present in the kidney. In situ hybridization (methods as for Example 6) was performed 
to determine which cell type(s) of the kidney these mRNAs are expressed in. 
mmFATPl mRNA was present in virtually all cells throughout the kidney with no 
obvious preference for a particular cell type. In contrast, mmFATP2 was expressed 
only in the renal cortex. Within the cortex, expression of mmFATP2 was restricted to 

2 0 the epithelial cells of the proximal renal tubules. The primary function of proximal 
renal tubule cells is the reabsorption of filtered salts and nutrients (e.g., glucose), a 
process that requires mitochondrial oxidation and that can utilize fatty acids as energy 
substrates. Based on the localization of mmFATP2, it is possible that mmFATP2 is 
important for reabsorption in the kidney by allowing uptake of an energy source (fatty 

2 5 acids) from the blood into renal epithelial cells. Alternatively, if fatty acids need to be 
reabsorbed in the kidney, similarly to glucose, FATP2 could be involved in the 
reabsorption of fatty acids. Determination of the subcellular localization of FATP2 will 
distinguish between these two possibilities. 
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Example 15: Isolation of full-length human FATP3 

Full-length clones encoding human FATP3 were identified by searching 
databases for sequences similar to the murine FATP1-5 coding regions using the BlastX 
algorithm (Altschul et al, J. Mol. Biol. 215: 403-410, 1990). Human clones with 
5 similarity to the 5' end of murine FATP sequences were sequenced completely. A clone 
encoding full-length human FATP3 was obtained from a human bone library 
constructed in the mammalian expression vector pMET7 (Tartaglia, L.A. et al., Cell 83: 
1263-1271, 1995). To identify human cDNA clones encoding FATP family members, 
databases were searched for sequences similar to murine FATP 1-5 coding regions. One 

1 0 clone was found to encode the human ortholog of mmFATP3 and was designated 

hsFATP3. The DNA and predicted protein sequences of hsFATP3 are shown in Figures 
94A and 94B. hsFATP5 is predicted to encode a 703 amino acid 75.6 kD protein with 
multiple membrane-spanning domains. A comparison of the DNA sequences of mouse 
and human FATP3 shows that the mouse and human orthologs are 81% identical to 

1 5 each other within the coding region. At the amino acid level, hsFATP3 is ~ 86% 
identical to mm FATP3 within the coding region. The sequence identities between 
mouse and human FATP3 are considerably higher than those observed between 
different FATP family members within one species (-40%) and are present in the N- 
terminal part of the protein, a region that is poorly conserved between different FATP 

2 0 family members. 

Example 16: Substrate Specificity of Fatty Acid Transport in hsFATP-Transfected 
Clones 

Using a mammalian expression vector, we generated 40 stable 239 cell lines 
expressing hsFATP4 and 20 cell lines transfected with a control plasmid. The ability of 
2 5 the different cell lines to take up FA, as assessed by uptake assays using the 

fluorescently labeled Bodipy-palmitate, correlated well with their FATP4 expression 
levels determined by Western blotting (FIG. 95). All 20 vector control clones showed 
amounts of Bodipy-FA uptake similar to each other and to untransfected 239 cells. In 
contrast, among the 40 FATP4 transfected clones, a large number (-20) showed an 
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approximately 2-fold increase in Bodipy-FA uptake compared to any of the vector 
controls, and three had a 5- to 10-fold increase in Bodipy-FA uptake. 

Several of the cell lines with the highest amount of Bodipy-FA uptake as well as 
isolated primary enterocytes were used to measure the uptake of radiolabeled FAs. 
5 Short-term uptake by 293 cells and enterocytes of all FAs tested was linear (FIG. 97). 
hsFATP4 expression enhanced the rate of palmitate uptake approximately 3 fold over 
293 cells transfected with vector alone (FIG. 97) and also accelerated the uptake of 
oleate but not of linolate, arachidonate, octanoate, butyrate or cholesterol (Table 6). 
Isolated primary enterocytes showed a similar preference for palmitate and oleate, and 

1 0 absence of transport of arachidonate, octanoate, and butyrate, but displayed a more 
robust transport of linolate and cholesterol than the transfected 293 cells. 

To further characterize the substrate specificity of FATP4, we measured the 
uptake by stably transfected 293 cells of 5 |lM Bodipy-FA in the presence of a 20 fold 
molar excess (i.e., 100 \lM) of FAs, FA-derivatives and lipid soluble vitamins and 

15 hormones. Both saturated and non-saturated fatty acids containing 10 to 26 C atoms 
strongly competed for uptake of Bodipy-palmitate (FIG. 96 and Table 7) and thus are 
presumed to be substrates of FATP4. In contrast, fatty acids with eight of fewer C 
atoms did not compete and thus are presumed not to be FATP4 substrates. Similarly, 
esters of long chain FAs and other hydrophobic molecules tested had no effect on 

2 0 uptake of Bodipy-palmitate. 

LCFA Uptake Assays (Methods) 

Bodipy-FA uptake assays using FACS were performed, adapted to a 96-well 
format. LCFA uptake assays with enterocytes or with stably transfected 293 cells were 
done as follows. Mixed micelles of radiolabeled FA (NEN) and taurocholate (Sigma) in 
2 5 HBS were generated by brief sonication at 37°C. Equal volumes of cells and micelle 
solution were mixed, resulting in a final FA concentration of 25 [iM for antisense 
assays and 10 \xM for substrate specificity assays. Final taurocholate concentration was 
5 mM. Cells were incubated for the indicated amount of time at 37°C. The assay was 
stopped by transferring the cells onto filter paper followed by extensive washes with 
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ice-cold HBS containing 0.1% BSA using a cell harvester (Brandell). Incorporated 
oleate was then determined by |3-scintillation counting (Beckman). 
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Table 6 

Uptake of Different Substrates by FATP4 Expressing Cell Lines and Enterocytes 





Fatty Acid 


293 Cells 
Control* 


293 Cells 

Stably 
Expressing 
FATP4. 


FATP4 
specific 


Enterocytes* 




Palmitate 


564 


1695 


1131 


3036 


5 


Oleate 


662 


1122 


459 


117 




Linolate 


640 


673 


33 


116 




Arachidonate 


3 


5 


2 


0 




Octanoate 


0 


0 


0 


5 




Butyrate 


0 


50 


50 


73 


O io 


Cholesterol 


319 


345 


26 


531 



Uptake of different substrates by enterocytes and by control and 
stable FATP4-expressing 293 cells. The rates of uptake for the 
indicated fatty acids was measured over 4 min taking 



measurements every 30 s. All fatty acids were at a concentration 
15 of 1 0 |JtM in HBS containing 5 mM taurocholate. 

*Uptake measured as pmol/min 10 6 cells 
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Table 7 

Competition of Bodipy-FA Uptake by FATP4 Expressing Cells 



r day aciqs 


Formula 


Competition 


ouiyric /vciQ 


XJ C\ 




^aproic /\ciu 


nun 




uaprync Acid 


f-l TJ n 

<-8 M 16 U 2 




Capric Acid 


r^ 1 TT o 

^10^20^2 


++ 


-Laurie Acia 


IT r\ 
V ^12 ri 24 VJ 2 


++ 


iviynstic Acid 


p TT A 


++ 


x^dlinillC- r\\yiQ. 


P it p| 

v -'16 ri -32 u 2 


-H- 


Stearic Acid 




+ 


Oleic Acid 


C 18 H 34 0 2 


++ 


Linoleic Acid 




++ 


Arachidic Acid 


C20H40O2 


++ 


Lignoceric Acid 


C24H48O2 


++ 


Cerotic Acid 


^26^5202 


++ 



Fatty Acid Derivatives 



Fatty Acids 


Formula 


Competition 


Palmitic Acid Methyl 
Ester 


C 17 H 34 0 2 




Stearic Acid Methyl Ester 


Ci9H 38 0 2 




Oleic Acid Ethyl Ester 


C20FI38O2 




Oleic Acid Oley Ester 






Oleoyl CoA 


C 39 H 68 N 7 0 17 P 3 S 




Cholesteryl Oleate 


C4sH 78 0 2 
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Table 7 Continued 

Competition of Bodipy-FA Uptake by FATP4 Expressing Cells 



Lipid-Soluble Vitamins & Homones 



Fatty Acids 


Formula 


Competition 


Retinoic Acid (Pro- Vitamin A) 


C20H28Q2 


± 


Ergocalciferol (Vitamin D2) 


C28H44O2 




Tocopherol (Vitamin E) 


C29H50O2 




3-Phytylamenadione (Vitamin 
Kl) 






Prostaglandin E2 







Q Competition for Bodipy-FA uptake by FATP4 expressing cells by 

5 n different hydrophobic compounds. The uptake of 5 |iM Bodipy-FA, Cl- 

P Bodipy-C 12 was measured in the presence of a 20-fold molar excess 
5 100 |lM) of the indicated fatty acids or fatty acid derivatives. The 

1 5 maximal 1 00% inhibition was defined as the amount of Bodipy-FA 

fU incorporated in the presence of 200 [iM lauric acid which was on 
average 1 8% ± 5% that of untreated cells. 



-: 0% - 30% inhibition by the indicated substance 
±: 30% - 50% inhibition 
2 0 +: 50% - 70% inhibition 

++: 70% - 100% inhibition 
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All references cited herein are incorporated by reference in their entirety. 

While this invention has been particularly shown and described with references 
to preferred embodiments thereof, it will be understood by those skilled in the art that 
various changes in form and details may be made therein without departing from the 
5 spirit and scope of the invention as defined by the appended claims. 
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CLAIMS 



What is claimed is: 



A method for identifying an agent which binds to a protein comprising an amino 
acid sequence of SEQ ID NO:49 or SEQ ID NO:53, comprising the steps of 
contacting the agent with the isolated protein under conditions appropriate for 
binding of the agent to the isolated protein, and detecting a resulting agent- 
protein complex. 



2. The method of Claim 1 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

10 3 . The method of Claim 1 wherein the isolated protein is in isolated plasma 
membrane. 



4. A method for identifying an agent which inhibits interaction between an isolated 
protein comprising amino acid sequence SEQ ID NO:49, or SEQ ID NO:53, and 
further comprising a ligand of said protein, comprising: 
15 (a) combining: 

( 1 ) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 

2 0 under conditions appropriate for interaction between the said 

protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 

2 5 interaction of said protein of (1) and the ligand of (2) occurs in the 
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absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligandof(2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

The method of Claim 4 wherein (a) is performed in an artificial membrane 
system. 

The method of Claim 4 wherein said isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which binds to a protein, said protein encoded 
by (1) a polynucleotide comprising a nucleotide sequence which encodes a 
naturally occurring allelic variant of a polypeptide consisting of the amino acid 
sequence of FATP2, wherein said polynucleotide hybridizes to a complement of 
a polynucleotide consisting of SEQ ID NO:48 under high stringency conditions, 
or by (2) a polynucleotide comprising a nucleotide sequence which encodes a 
naturally occurring allelic variant of a polypeptide consisting of the amino acid 
sequence of FATP4, wherein said polynucleotide hybridizes to a complement of 
a polynucleotide consisting of SEQ ID NO:52 under high stringency conditions, 
comprising the steps of isolating the protein, contacting the agent with the 
isolated protein under conditions appropriate for binding of the agent to the 
isolated protein, and detecting a resulting agent-protein complex. 

The method of Claim 7 wherein the step of contacting the agent with the isolated 
protein is performed in an artificial membrane system. 
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The method of Claim 7 wherein the isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which inhibits interaction between (1) an 
isolated protein, said protein being encoded by (i) a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP2, wherein said 
polynucleotide hybridizes to a complement of a polynucleotide consisting of 
SEQ ID NO:48 under high stringency conditions, or by (ii) a polynucleotide 
having a nucleotide sequence which encodes a naturally occurring allelic variant 
of a polypeptide consiting of the amino acid sequence of FATP4, wherein said 
polynucleotide hybridizes to a complement of a polynucleotide consisting of 
SEQ ID NO:52 under high stringency conditions and (2) a ligand of said protein, 
comprising: 

(a) combining: 

(1) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between said protein 
of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 
interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
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candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

The method of Claim 10 wherein (a) is performed in an artificial membrane 
system. 

The method of Claim 10 wherein said isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which binds to a protein encoded by a nucleic 
acid encoding a fatty acid transport protein comprising an amino acid sequence 
sharing at least about 95% amino acid sequence similarity with the amino acid 
sequence in SEQ ID NO:49, or SEQ ID NO:53 comprising the steps of isolating 
the protein, contacting the agent with the isolated protein under conditions 
appropriate for binding of the agent to the isolated protein, and detecting a 
resulting agent-protein complex. 

The method of Claim 13 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

The method of Claim 13 wherein the isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which inhibits interaction between (i) an 
isolated protein encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 90% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO:49, or (ii) a 
protein encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 90% amino acid 
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sequence similarity with the amino acid sequence in SEQ ID NO:53 and a ligand 
of said protein, said method comprising: 
(a) combining: 

( 1 ) said isolated protein; 
5 (2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 

interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between the said 
protein of (1) and the ligand of (2); 
1 0 (b) determining the extent to which said protein of (1) and the ligand of (2) 

interact; and 

(c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
1 5 conditions appropriate for interaction of said protein of ( 1 ) with the 

ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
2 0 between said protein and the ligand of said protein. 

17. The method of Claim 16 wherein (a) is performed in an artificial membrane 
system. 

18. The method of Claim 16 wherein said isolated protein is in isolated plasma 
membrane. 



25 19. 



A method for identifying an agent which is an inhibitor of fatty acid uptake by 
(i) a protein encoded by a polynucleotide comprising a nucleotide sequence 
which encodes a protein consisting of the amino acid sequence in SEQ ID 
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NO:49, or by (ii) a protein encoded by a polynucleotide comprising a nucleotide 
sequence which encodes a protein consisting of the amino acid sequence in SEQ 
ID NO:53 ? comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; 

wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 19. 

The method of Claim 19 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 21. 

A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein encoded by (i) a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 
consisting of the amino acid sequence of FATP2, wherein said polynucleotide 
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hybridizes to a complement of a polynucleotide consisting of SEQ ID NO:48 
under high stringency conditions, or by (ii) a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP4, wherein said 
5 polynucleotide hybridizes to a complement of a polynucleotide consisting of 

SEQ ID NO: 52 under high stringency conditions, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

10 c) comparing uptake of the fatty acid in the test cells with uptake of the 

fatty acid in suitable control cells; 
wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

15 24. An inhibitor of fatty acid uptake identified by the method of Claim 23 . 

25. The method of Claim 23 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

2 0 c) measuring exogenously supplied fatty acids in one or more comparable 

samples of tissue or bodily fluid from suitable control animals; 
d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 



2 5 26. 



An inhibitor of fatty acid uptake identified by the method of Claim 25. 
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A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein being encoded by (i) a nucleic acid encoding a fatty acid 
transport protein comprising an amino acid sequence sharing at least about 95% 
amino acid sequence similarity with the amino acid sequence in SEQ ID NO:49 
or by (ii) a nucleic acid encoding a fatty acid transport protein comprising an 
amino acid sequence sharing at least about 95% amino acid sequence similarity 
with the amino acid sequence in SEQ ID NO:53, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; 

wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 27. 

The method of Claim 27 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 27. 
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A method for identifying an agent which is an inhibitor of (i) a protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ED NO:49 or (ii) a protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ID NO:53, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

The method of Claim 3 1 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by (i) a polynucleotide comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP2, wherein said polynucleotide hybridizes to a 
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complement of a polynucleotide consisting of SEQ ID NO:48 under high 
stringency conditions, or by (ii) a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 
consisting of the amino acid sequence of FATP4, wherein said polynucleotide 
hybridizes to a complement of a polynucleotide consisting of SEQ ED NO:52 
under high stringency conditions, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

The method of Claim 33 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by (i) a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 95% amino acid 
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sequence similarity with the amino acid sequence in SEQ ID NO:49 or by (ii) a 
nucleic acid encoding a fatty acid transport protein comprising an amino acid 
sequence sharing at least about 95% amino acid sequence similarity with the 
amino acid sequence in SEQ ID NO:53, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of said protein. 

The method of Claim 35 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c). 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which binds to a protein comprising an amino 
acid sequence of SEQ ID NO:57, comprising the steps of contacting the agent 
with the isolated protein under conditions appropriate for binding of the agent to 
the isolated protein, and detecting a resulting agent-protein complex. 
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The method of Claim 37 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 

The method of Claim 37 wherein the isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which inhibits interaction between an isolated 
protein comprising an amino acid sequence of SEQ ID NO:57, and further 
comprising a ligand of said protein, comprising: 

(a) combining: 

(1 ) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between the said 
protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 
interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

The method of Claim 40 wherein (a) is performed in an artificial membrane 
system. 
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42. The method of Claim 40 wherein said isolated protein is in isolated plasma 
membrane. 

43. A method for identifying an agent which binds to a protein, said protein encoded 
by a polynucleotide comprising a nucleotide sequence which encodes a naturally 

5 occurring allelic variant of a polypeptide consisting of the amino acid sequence 

of FATP6, wherein said polynucleotide hybridizes to a complement of a 
polynucleotide consisting of SEQ ID NO:56 under high stringency conditions, 
comprising the steps of isolating the protein, contacting the agent with the 
isolated protein under conditions appropriate for binding of the agent to the 
1 0 isolated protein, and detecting a resulting agent-protein complex. 

44. The method of Claim 43 wherein the step of contacting the agent with the 
isolated protein is performed in an artificial membrane system. 

45. The method of Claim 43 wherein the isolated protein is in isolated plasma 
membrane. 

1 5 46. A method for identifying an agent which inhibits interaction between (1) an 
isolated protein, said protein encoded by a polynucleotide comprising a 
nucleotide sequence which encodes a naturally occurring allelic variant of a 
polypeptide consisting of the amino acid sequence of FATP6, wherein said 
polynucleotide hybridizes to a complement of a polynucleotide consisting of 

2 0 SEQ ID NO: 56 under high stringency conditions, and (2) a ligand of said 

protein, comprising: 
(a) combining: 

( 1 ) said isolated protein; 

(2) the ligand of said protein; and 

25 (3) a candidate agent to be assessed for its ability to inhibit 

interaction between said protein of (1) and the ligand of (2), 



WHI9721p3MC2 



-105- 

under conditions appropriate for interaction between said protein 
of (1) and the ligand of (2); 
(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

5 (c) comparing the extent determined in (b) with the extent to which 

interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

1 0 wherein if the extent to which interaction of said protein of (1) and the ligand of 

(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

47. The method of Claim 46 wherein (a) is performed in an artificial membrane 
15 system. 

48. The method of Claim 46 wherein the isolated protein is in isolated plasma 
membrane. 

49. A method for identifying an agent which binds to a protein encoded by a nucleic 
acid encoding a fatty acid transport protein consisting of an amino acid sequence 

2 0 sharing at least about 95% amino acid sequence similarity with the amino acid 

sequence in SEQ ID NO: 57 comprising the steps of isolating the protein, 
contacting the agent with the isolated protein under conditions appropriate for 
binding of the agent to the isolated protein, and detecting a resulting agent- 
protein complex, 

2 5 50. The method of Claim 49 wherein the step of contacting the agent with isolated 
protein is performed in an artificial membrane system. 
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The method of Claim 49 wherein the isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which inhibits interaction between an isolated 
protein encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 90% amino acid 
sequence similarity with the amino acid sequence in SEQ ID NO:57 and a ligand 
of said protein, said method comprising: 

(a) combining: 

( 1 ) said isolated protein; 

(2) the ligand of said protein; and 

(3) a candidate agent to be assessed for its ability to inhibit 
interaction between said protein of (1) and the ligand of (2), 
under conditions appropriate for interaction between the said 
protein of (1) and the ligand of (2); 

(b) determining the extent to which said protein of (1) and the ligand of (2) 
interact; and 

(c) comparing the extent determined in (b) with the extent to which 
interaction of said protein of (1) and the ligand of (2) occurs in the 
absence of the candidate agent to be assessed and under the same 
conditions appropriate for interaction of said protein of (1) with the 
ligand of (2); 

wherein if the extent to which interaction of said protein of (1) and the ligand of 
(2) occurs is less in the presence of the candidate agent than in the absence of the 
candidate agent, the candidate agent is an agent which inhibits interaction 
between said protein and the ligand of said protein. 

The method of Claim 52 wherein (a) is performed in an artificial membrane 
system. 
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The method of Claim 52 wherein said isolated protein is in isolated plasma 
membrane. 

A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein encoded by a polynucleotide comprising a nucleotide sequence which 
encodes a protein consisting of the amino acid sequence in SEQ ID NO:57, 
comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; 

wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 55. 

The method of Claim 55 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 57. 
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A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein encoded by a polynucleotide comprising a nucleotide 
sequence which encodes a naturally occurring allelic variant of a polypeptide 
consisting of the amino acid sequence of FATP6, wherein said polynucleotide 
hybridizes to a complement of a polynucleotide consisting of SEQ ID NO: 56 
under high stringency conditions, comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; 

wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 59. 

The method of Claim 59 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 61. 
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A method for identifying an agent which is an inhibitor of fatty acid uptake by a 
protein, said protein being encoded by a nucleic acid encoding a fatty acid 
transport protein comprising an amino acid sequence sharing at least about 95% 
amino acid sequence similarity with the amino acid sequence in SEQ ID NO:57, 
comprising the steps of: 

a) maintaining test cells expressing said polynucleotide in the presence of a 
fatty acid and an agent to be tested as an inhibitor of fatty acid uptake; 

b) measuring uptake of the fatty acid in the test cells; and 

c) comparing uptake of the fatty acid in the test cells with uptake of the 
fatty acid in suitable control cells; 

wherein lower uptake of the fatty acid in the test cells compared to uptake of the 
fatty acid in the control cells is indicative that the agent is an inhibitor of fatty 
acid uptake by said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 63. 

The method of Claim 63 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from said test animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

An inhibitor of fatty acid uptake identified by the method of Claim 65. 
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A method for identifying an agent which is an inhibitor of a protein encoded by 
a polynucleotide comprising a nucleotide sequence which encodes a protein 
comprising the amino acid sequence in SEQ ID NO:57, comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

The method of Claim 67 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by a polynucleotide comprising a nucleotide sequence which 
encodes a naturally occurring allelic variant of a polypeptide consisting of the 
amino acid sequence of FATP6, wherein said polynucleotide hybridizes to a 
complement of a polynucleotide consisting of SEQ ID NO:56 under high 
stringency conditions, comprising the steps of: 
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(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 

(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to 
the second aliquot is indicative that the agent is an inhibitor of said protein. 

The method of Claim 69 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c); 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a protein, said protein 
being encoded by a nucleic acid encoding a fatty acid transport protein 
comprising an amino acid sequence sharing at least about 95% amino acid 
sequence homology with the amino acid sequence in SEQ LD NO: 57, 
comprising the steps of: 

(a) introducing into host cells one or more vectors comprising a 
polynucleotide expressing said protein; 

(b) culturing a first aliquot of the host cells with fatty acid substrate of said 
protein and with an agent being tested as an inhibitor of said protein; 
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(c) culturing a second aliquot of the host cells with fatty acid substrate of 
said protein; 

(d) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
second aliquot is indicative that the agent is an inhibitor of said protein. 

The method of Claim 71 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c). 

whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

A method for identifying an agent which is an inhibitor of a fatty acid transport 
protein, comprising the steps of: 

(a) introducing into cells one or more vectors comprising a gene encoding a 
cell surface protein and a nucleic acid encoding the fatty acid transport 
protein; 

(b) contacting the host cells with anti-cell surface protein antibody and 
labeled fatty acid substrate of the fatty acid transport protein; 

(c) contacting a first aliquot of the host cells with an agent being tested as an 
inhibitor of the fatty acid transport protein, while leaving a second 
aliquot of the host cells uncontacted with the agent; 

(d) identifying, in the first and second aliquots, the host cells expressing the 
cell surface protein by detecting the anti-cell surface protein antibody 
bound to the host cells; and 
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(e) measuring ? in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells identified as expressing the cell surface 
protein; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
5 second aliquot is indicative that the agent is an inhibitor of the fatty acid 

transport protein. 

74. The method of Claim 73 wherein the host cells regulably express the FATP4 
gene. 



75. The method of Claim 73 wherein the host cells are prokaryotes. 
10 76. The method of Claim 73 wherein the prokaryotes are E. colL 

77. The method of Claim 73 wherein the fatty acid is a radioactively labeled fatty 
acid. 

78. A method for identifying an agent which is an inhibitor of FATP4, comprising 
the steps of: 

1 5 (a) introducing into cells one or more vectors comprising a gene encoding a 

cell surface protein and a nucleic acid encoding FATP4; 

(b) contacting the host cells with anti-cell surface protein antibody and 
labeled fatty acid substrate of FATP4; 

(c) contacting a first aliquot of the host cells with an agent being tested as an 
2 0 inhibitor of FATP4, while leaving a second aliquot of the host cells 

uncontacted with the agent; 

(d) identifying, in the first and second aliquots, the host cells expressing the 
cell surface protein by detecting the anti-cell surface protein antibody 
bound to the host cells; and 
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(e) measuring, in the first and second aliquots, uptake of the fatty acid 
substrate of the host cells identified as expressing the cell surface 
protein; 

wherein less uptake of the fatty acid substrate in the first aliquot compared to the 
5 second aliquot is indicative that the agent is an inhibitor of FATP4. 

The method of Claim 78 further comprising the steps of: 

a) administering the agent to one or more test animals; 

b) measuring exogenously supplied fatty acids in one or more samples of 
tissue or bodily fluid from suitable control animals; 

c) measuring exogenously supplied fatty acids in one or more comparable 
samples of tissue or bodily fluid from suitable control animals; and 

d) comparing the fatty acids of b) with the fatty acids of c); 
whereby, lower fatty acids in step b) than in step c) is indicative that the agent is 
an inhibitor of said protein. 

1 5 80. The method of Claim 78 wherein the cell surface protein is CD2. 

81. The method of Claim 78 wherein the fatty acid substrate is BODIPY-labeled. 
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FATTY ACID TRANSPORT PROTEINS 

ABSTRACT OF THE DISCLOSURE 

A family of fatty acid transport proteins (FATPs) mediate transport of long chain 
fatty acids (LCFAs) across cell membranes into cells. These proteins exhibit different 
5 expression patterns among the organs of mammals. Nucleic acids encoding FATPs of 
this family, vectors comprising these nucleic acids, as well as the production of F ATP 
proteins in host cells are described. Also described are methods to test FATPs for fatty 
acid transport function, and methods to identify inhibitors or enhancers of transport 
function. The altering of LCFA uptake by administering to the mammal an inhibitor or 
1 0 enhancer of FATP transport function of a F ATP in the small intestine can decrease or 
increase calories available as fats, and can decrease or increase circulating fatty acids. 
The organ specificity of FATP distribution can be exploited in methods to direct drugs, 
diagnostic indicators and so forth to an organ such as the heart. 
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j^J3EESSESGCSI»^^ 50 

]R£ELRMOn^^ 100 

AimtenBisins^^ 150 

GZLSM^TOOr^YIFISC^ 250 

i^aau&megaiigE^^ 3Q0 

-•VEQH£EI£RmZ^^ 350 

TLEEMMMm^ 400 • 

KBEQGHaffil!^^ 450 

H/FElsTO^^ 500 

QESMTCGtflVPSIEH?^^ 550 

iTLKLQESLft!^^ 600 
i^SftllHSDESI 613 a 

mnEKIP4 M sequence 



aSIGS&GSOQGCTC^^ 80 

GGtXJIQCXIXMSia^ '120 

roamnomaG^ 160 

03OT3aaflCJaS3mX30^ 200 

G3ffiXIG33?mxm30GIQa 240 

iOCT33miinETCI^ 280 

O£TC?m30O20^ 320 

2sria2ACssH3^^ 360 

TKaasiacacaQami^^ 400 

GTMjnsxx&inxxxs^^ 440 

Gacax3Gonin3C!i^^ 480 . 

Q33£JJl!GG?mG&^ 520 

0333SGRflOT3^^ 560 

jm3\&3®Ca&2I^^ 600 , 

TCIGOOGCIMII^^ 640 

CBsa^r&rcrGsa^^ 720 



^Ti^UJuL loft 



.ro3cmm[Q3cirasrici^ iso 

CX3I2B33XIX3^^ 800 

ooaic&Hiasa^^ 840 

Gmo&oa^ 880 

GasmxaaraaGiCTGC^^ 920 

tochxjkiotoc^^ 1000 

ofiG&aariGC^^ 1040 

G3mCD3^ 1120 

aEGaasGooao^ 1160 - ■ • 

^jsoczoxraioz^^ 1200 

MQ3IG07IG£i23^^ 1240 

GGCIEGCOSnOT^^ 1280 

£j3CITIGC^mX^ 1320 

craxcmiiCTiCT 1360 

o^CAQSsmnc^s^^ 1400 

G&3332ITI£2m2£^^ 1440 

G3G30CnMmXm^^ 1520 

amriTCOxaacmra 1560 

sa^^jsacxiEms^^ 1600 

iamssoaaaQaaG^ 1640 

imGCiaaciaii^^ 1680 

GEGcnrriaxasccxzcca^^ 1720 

araiGC^jxroi^^ 1760 

cmiaajanscaGmoG^ 1800 

Q3QTOCaGQQQQCX3m3IHim33^^ 1840 

Tcmira^casms^ 1880 

A2%(Z3CICSEm3DC3^^ 1920 

GIQ333iQK3aX^^ 1960 

203H]C£iym5P^^ 2000 

G&aMG&GG?003ffl3IG3^^ 2040 

Gsj^srcaQsmsa^^ 2080 

ccctksticti^^ 2120 

T^GsozHramiini^^ 2160 

cmciGiramxs^^ 2200 

AS^SEGaos^iams^^ 2240 

MMMJ^M^JJmZ&ZA 2301 



mm&FP4 protein segoeice 



HaSfflft 9 3 «^^ AO 
S^ 1332 ^^^ 36Q 

Hi ■ ____ _ ' • "^isuju*. u 

m nrnEaiPS im sequence ■ 

* ^^^^^^ 

.350 

• ^™3in^^ 560 

Fig uajl \a ft 



^-^^ 880 

affiaoaamsar^^ 1000 

<3H33O^a03[3XE^^ 1080 

TIQ3 P [Z! ^^ 1120 

ciGiikrcraH^^ 

^ooacnnaaaaaftaan^ 13 20 

QC:mQa ^^ 1360 ' 

S Q3G 3C ^ 1440 " 

y 164-0 

Q/'^^ 3313 ^^ 1720 

53 T3rSm ' 1 ^^ 1800 

i is^o 

03 <3^33cm3Groaaac^ 1880 

GSa^GWOZIMS^^ 1920 

O™ 3 ^^ I960 

Q 3 * 3 ^^^ 2000 

SEQ03nSa33 ^^ 20*0 

TClra22 ^^ 2080 

TCaSe ^^ 2200' 

S™ 3 ^^^ 2240 



6 



IraR ^P5 protein sequence 



SSS^ SO 

^^^^ 200 

' ^E®r!P2 r®. sequence V ^ lxjul \^ 

■."^^^^ . 

^OGa^ on 

^^^^^^^^^ lf Q 
5™^^ 52 o 



AAaGG&aaiEeia^^ 840 

TX3£X3EX&2&Bnnn!C^^ 880 

AGKisoinramaa^ 920 

(3&GnX3CX3l£Sm&^ 960 

GsaasiasiMa^^ 1000 

03O3iroG&a^^ 1040 

caaa2a3m3®MiaxB^^ 1080 

anocnaii^^ 1120 

NiaasaHaam^ 1160 

azoaaac&a&Kiex^^ 1200 

caoaazfflBiGaa^^ 1280 . 

QS^MIICtaTra^^ 1360 

j\j203CTDS^22^^ 1400 

Gffl3y33W¥imX3a2GS^^ 1440 

£SOI[TITIft&^^ 1480 

GsasssmB&rmri?^^ .1520 

•' GEffiJTn^a^^ 1560 

QTITi&iSAA^ 1600 

, AaAaaaaftaftaaaaaazy^^ 1622 fr^u^x*- l ^ 

lisEAiES protein sequence 

Ym93IT3LEKa^^ 40 

80 

KKawTregHOT j Rm^NEFgcamsK^Riim^ 120 

RGDVfKQF^^ 160 

(^mjelpkecoifh^ 200 - 

\OII#CinD3^ 240 

M-DDIZMCHENinZE^^ 280 
GH/LFF 286 i< 

hs?3ffiP3 EN&. sequence 

C£ifl7ia333^3^^ 40 

A33IGaQa3iQ93G&aQ3^^ 80 

'isassKnainnOT^ 120 



Gsrasc^issDzri^^ 240 

ct^^jsctaig^^ 280 

G3XX338&33C8332C^^ 320 

TTIGS033MQ3a3CI^^ 360 

TraacaqcnE^^ 400 

213ICmG33Cm^^ 440 

TO33ffi3C£MA!It3^^ 480 

<3ra3C!IGim2EICIQ^ 520 

TOCOX!IOO«CIQC^^ 560 

2&BOZJnX2l3£®tt 600 

i^J3^^J£!iam3333IQ33^^ 640 

TOSQCIGIGkHasmjIO^ 680 

CEmrraiaaa^^ . 720 

TC3330330afta^^ 753 
hsE?ffiP3 protein sequence 

QH3EEe3IWEeLQVSQ2^ 40 

IDQSMSBHEZIGII^^ 8Q 

" ^aavBsviyKHSss^^ 120 

IPFE\KH?EI1^ESI^ 160 

EELTfUIDQ^Sa^^ 191 ^ ulAJL V 

hsE2ffiP4 im sequence 

TO^cnmai!rCT3^mm3^ so • 

££r?02&cez^G3m 120 

Gcaonxamao^ 160 

0332AQ3IG3CIG30?r^^ 200 

GmGCUI033C?^CTIC53Om3^^ 240 

TITC&ai^I33CMa^^ 280 

TimisoaiQD3«a3SQSi^^ 320 

G3aOSSm333]CTE3^^ 360 

G3GCH03m[G33C03^^ 400 
03CO3C?IDmffi23C^^ 

^^easmsy^osan^^ 480 

CEOZnaCT^JE^^ 520 



-EiOIDSBCTia^I^^ -560 

ctoaisiQxc&am 720 

<CT3T3CIQmEC 734 I^^UJ^ 2 -^^ 
hsEKTP4 protein sequence 

I(23jCmiI2^PH?E2^OT3^M^^ 40 

I&HIB2\&EF^3mi^^ SO; - - 

KDELERBEGSOGNQp^^ 160 

nsrr3rr;vwi ^u-a H'hiftArer^/iTr^ 200 

213 ^L^yjJU. ^ 
hsEKEP5 sequence 

amm^iiGCTiss^ 80 

(XOIASGITCIO^^^ 120 

C£G0MmS?3OGI^I^^ 160 

racoNEaraiG^^ 200 

SOOSa30GKrQDC2QX2^^ 240 

<2CE3Kn3£3I&2^^ 280 

AEITa3S*IC2N33^^ 320 

0333GSO3Q3O3C^^ 400 

TOcinicaxciT^ 440 

G3333aikninK3KX33^^ 480 

oaAQQC3^imicaD3iQ^^ 560 

GaGnxasGSflaasaaGcarate 600 

033smm^j3O33!naai^^ 640-' 

zcnsss&GQonzn^ 680 

0O3?ia33aTG3&^^ 720 

Tiwmropansa^^ soo 

G3IG3GCKIQ3C3T3C2^^ ■ 



TJnsasaSGBGt&QES^^ 880 

TO0333aGiaa3£^^ 920 

cG03ffl33a&33om^ 960 

OSSriQQimSIiH^^ 1000 

.afoaL'lt^UmGnSOQ^^ 1040 

<3D00C2Hm33KSaz^ 1080 

Axrosaaacrcna^^ 1120 

G3!Iiam33!G33m31!^^ 1160 

axrncMH^mOT^ 1200 

ADCor^ixnrs^^ 1240 

•ESXSGIGSCSS^^^ 1278 



hsS3S!P5 protein, sequence 



E33330GP^3^GE02aX^^ 40 

E&AEF\M£Q3KD3M3I2^^ 80 
KELSEFOTFMZE£^ 120 

<X0MW33E3W5IHEXmfcS^^ 160 

• GKWaiMSS&MQGPE^^ 199 

hsEaTP6 Em sequence 

OXSSBGISOmi^^ 80 

joamBsaGaa^ 120 

@S^JO^I5^i3Sm3a5^^ 160 

G3£AMG3C?mm3G^ 200 

io^iarnnsA^iiMft^^ 240 

^I33ajm3U03^ 280 

iOTIGS^AiST^^ 320 

TITCCS^HTniSO^ -360 

'IGMaXM£3£&&M^^ 400- 

juaaGSBGRCciQG^^^ 440 

ASMICCLLLailta^^^ 480 ■ 

2ES&&Ch!^ 520 

onosozns&E^^ 560 

i^j^maziTrim^ 600 

OGEIQaAaGSB^^ 640 

SSHIWIQ^iSnP^^ .680 

axnMGGiG]m3®2syr^^ 720 • 



TIG^iy^i3ITl2^^ 800 

cnTiMGCTGiaaoaam™ 840 

G^IO^OGG^OSOT^ 880 
toGaRGHBSffi^^ 920 

TTiaGnnfflHSfE^^ 960 
AAzra2T&aGK^ 1040 

CnSOSWOSGOEB^^ 1080 
jfflEG33&aGQG&^^ 1120 

TiTO^riaaaMG?^ 1160 

Tm^llLlTlUiA^^ 1200 
<ZTITr3mcnmTIG^^ 1240 • • • 

tidsotjeso^ 1280 

GoaGmGoaiaajroaciaw^^ 1320 

M&zmnEn&^^ 1360 
A 1361 ^rv^uJoL. 

iisE&EPS protein seaueoce 

ACMUKKBsasgsvimx 50 

IAJDSCEniSDWaREEUI^^ 100 

NLFYKIXSa^^ 150 

PtSSOPYKEg^ 200 

EWFOiNli!^^ 250 

SUZEKVXEQVVTEXJ^^ 300 

LKISEPLYIM^ 335 

intESJEP -TNa. sequence. 

' asGmaaasymKazoa^^ 40 

TI0ZD3£G3IIG3I^ -80 
.G^J^Sfl33CTIM^ 120 
OOO^mBISmSSa^^ 160 ■ 

cms^siimpGsoo^^ 200 

MK^I333CrD3CI^^ 240 
3!ia3GC2miEGT]IX2^ 280 

aossmiriaoi^^ 320 . 

O030^i33CrEmrCKmX3^ra 360' " 



TCKHSTIOTI^^^ 440 

' G33309GTCaa3IQ33Q^^ 480 

^i33£C&3Cm3^^ 520 

tfosiasaamaoo^ 56q 

C33393D3mDXnX3SOD^^ -540 

CCX2£C33Cn3J2GQ^ 680 

GsmrazmaAatsoaiKi^^ 720 

<3C3Caam33SH30X3^^ 760 

800 - 

CCGCI!3ISCC3!CM£3^^ 880 

0933CSm33GITO3ini3O^ 1000 

#rCIGCTC^J0C2m^^ 1040 

a»II3IGCH2GIGfi^^ 1080 

^3i333aaEaG^ 1120 

TCamssoncraaxDaD^^ H6o 

"IMC^l^ICI^^ 1200 

• CaSMGCOXSnQCE^^ 

AO^-^S^^OSO^^ 1320 

GOS^i^jGTIGSDS^^ 1400 

C 313 ™ 30 ^^ 1440 

iffl33;3a:a ^^ 1480 • 

TCoa:iG ^33^^ 1520 

^jscoxa^nscxs^^ 1560 

G ™ 33 ^ 1600 

(^xsccmszi!^ 1640 

OSSDCaOTGCTOSD^^ 1680 

Q™ 3 ™^^ 1720 

TO3D3aOi0 ^^ 17S0 

Q03:saoc ^^ 1800 

TZ&ZK^^ 1840 

jii ^™ai3^^ 1880 

Q3Q3 33^ 1320 

cisaaacanscaa^^ i960 

TICMcaQ ^^ 2000 . 

G03IT&G 2007 



latF&TB protein sequence 



gaiagmlii^^ 160 

cgasi^grvagcfelt^^ 200 

tsfyiftegttgf^^ 240 

satlysaLpl^^ 280 

fwdeviarimte^ 320 

cgnglxpaiwfe:^^ 360 

5 yprtagvspip^ 400 

2 glllsrwzrl^ 440 

O gdm&pcgjightt 480 

fi cfatveect^^^ 520 

□ rt^yghlpgy^ 560 





■ i 



hsFATPl 

1 teg acc cac ggc gtc egg gac ccc aaa gca gaa gec cgc aca gta ggc aca gcg cac cca 

6X aga agg gtc cag gag ccc gca gaa aca gaa agg ccc ccg gec cca gec ccc tag tec ccg 
121 ccc gec tec tgc ccg age ccc egg gag acc gaa ggc acg gec cgc age etc agg acg egg 

M R 

1S1 gec ccg ggt gcg ggc gcg gec teg gcg gec teg ccg gcg ccg teg egg ccg ctg ggg ccg 

APGAGAASVVSI.ALLWt.LQL 
241 ccg egg acc tgg age gcg gca gcg gcg etc ggc gtg tac gcg ggc age ggc ggc egg cgc 

PWTWSAAAALGVYVGSGGWR 
301 tec ccg cgc ate gec tgc aag acc gcg agg cga gac ccc ccc ggc ccc ccc gcg ccg acc 

FLRIVCKTARRDLFGLSVLI 
3 SI cgc gcg cgc jet g gag ccg egg egg cac cag cgc gec ggc cac acc ate ccg cgc ate ccc 

RVR 'LELRRHQRAGHTIPRIF 
421 cag gcg gta gcg cag cga cag ccc gag cgc ccg gcg ctg gtg gat gee ggg acc ggc gag 

QAVVQRQPERLALVDAGTGE 
481 cgc egg acc tec gcg cag ctg gac gec tac ccc aac gcg gta gec aac ccc etc cgc cag 

CWTFAQIiDAYSNAVANLFRQ 
541 ccg ggc etc gcg ccg gge gac geg gcg gee acc etc ccg gag ggc egg ccg gag ttc gcg 

LGFAPGDVVAXFLEGRPEFV 
601 ggg ctg egg ccg ggc ccg gec aag gcg ggc atg gag gee gcg ccg etc aac gcg aac ccg 

GLWLGLAKAGMEAALLNVHL 
661 egg cgc gag ccc ctg gec ccc tgc ctg ggc acc ccg ggc get aag gec ccg ate etc gga 

RREPLAFCLGTSCAKALXFG 
721 gga gaa atg gtg gcg gcg gtg gec gaa gcg age ggg cac ccg ggg aaa agt ctg acc aag 

GEMVAAVAEVSGH LGKSLIK 
781 etc tgc ccc gga gac ccg ggg ccc gag ggc acc ccg ccg gac acc cac etc ctg gac ccg 

FCSGDLGPEGIL PDTHLLDP 
841 ctg ctg aag gag gee tct acc gec ccc ctg gca cag ate ccc age aag ggc atg gac gac 

LLKEASTAPIiAQl PSKGMDD 
901 cgt cct ttc cac ate tac acg teg ggg acc acc ggg ctg ccc aag get gec att gee gtg 

RL-FYXYTSGTTGLPKAAIVV 
951 cac age agg cac tac cgc atg gca gec ttc ggc cac cac gee tac cgc atg cag gcg gec 

KSRYYRHAAFGHHAYRMQAA 
1021 gac gtg etc tat gac ege ctg ccc ctg tac cac teg gca gga aac ate ate ggc gcg ggg 

DVLYDCLPLYHSAGNIIGVG 
1081 cag tgc etc ate tat ggg ccg aca gtc gtc etc cgc aag aaa etc teg gee age ege etc 

QCL IYGLTVVLRKKFSASRF 
1141 tgg gac gac tgc ate aag cac aac tgc acg gcg gtt cag cac ate ggg gag ate Cgc cgc 

W.D DC IKYNC TVVQ Y I GEICR 
1201 cac ctg ccg aag cag ccg gcg cgc gag gcg gag agg cga cac ege gcg cgc ccg gcg gcg 

YLLKQPVREAERRHRVRLAV 
1261 ggg aac ggg ccg cgt ccc gee ate egg gag gag etc acg gag cgc etc ggc gca cgc caa 

GNGLRPAIWEEFTERFGVRQ 
1321 acc ggg gag ttc cac ggc gee acc gag tgc aac cgc age att gee aac atg gac ggc aag 

IGEFYGATBCNCSIANHDGK 
13 SI gtc ggc csc tgc ggc tec aac age cgc acc ctg ccc cac gcg tac ccc acc egg ctg gtg 

VGSCGFNSRILPKVYPIRLV 
1441 aag gtc aac gag gac aca atg gag ctg ctg egg gac gec cag ggc ccc cgc acc ccc cgc 

KVHEDTMELIiRDAQGLCXPC 
1S01 eag gec ggg gag ccc ggc etc ctt gcg ggt cag ate aac caa cag gac ccg ctg cgc cgc 

QAGEPGLLVGQINQQDPLRR 
1561 ttc gat ggc tat gtc age gag age gec acc age aag aag acc gee cac age gee ttc age 

FDGYVSESATSKKIAHSVF5 
1621 aag ggc gac age gec tac ccc Cca ggt gac gtg eta gtg atg gac gag ccg ggc tac atg 

KGDSAYLSGCVLVMDELGYM 
1681 tac ttc egg gac cgt age ggg gac acc etc cgc tgg cga ggg gag aac gee ccc acc acc 

YFRDRSGDTFRWRGEHVSTT 
1741 gag gcg gag ggc gtg ccg age cgc ccg ccg ggc cag aca gac gtg gec gtc tac ggg gtg 

EVEGVLSRLLGQTDVAVYGV 
1801 get gtc cca gga gtg gag ggc aag gca ggg atg gcg gec gtc gca gac ccc cac age ccg 

AV PGVEGKAGHAAVADFHSL 
1861 ccg gac ccc aae gcg aca tac cag gag ccg cag aag gtg ctg gca ccc tat gec egg ccc 

LDPNAXYQELQKVLAPYARF 
1921 ate ttc ccg cgc etc ctg ccc cag gcg gac acc aca ggc acc ttc aag acc cag aag acg 

IFLRLLPQVDTTGTFKIQKT 
19 91 agg ctg cag cga gag ggc ttc gac cca cgc cag acc cca gac egg etc ttc etc ctg gac 

RI*QREGFDPRQTSDRLFFLD 

2041 ccg aag cag ggc cac cac ccg ccc tea aac gag gca gec tac act cgc ate tgc teg ggc 

LKQGHYLPLMEAVYTRICSG 

2101 gec etc gec etc ega age tgt tec tct act ggc cac aaa ccc egg gec egg tgg gag agg 

A F A L * 

2161 cca get cga gee aga cag cgc cgc cca ggg gcg gec gee tag cac aca ccc acc tgg ccg 

2221 age tgt acc tgg eac ggc cca tec egg act gag aaa ctg gaa ccc cag agg aac ccg tgc 

2281 ccc tct get gee etg gtg ccc ctg tgt ctg cct ccc etc cct get ctt cag cct ctg ccc 

2341 ccc ccc ace cct gee cct gec egg cct taa ccc ttc ccc etc ttc cct ttc ttc ccc ccc 

2401 ttc ctt ctt etc aag ata gag tct cac tct get gec egg get aga geg cag tgg egg gat 

2461 etc ggc tea ctg caa cct ctg cct ccc ggg gee caa gtg acc etc cca cct cag ccc ccc 

2521 gag cag ctg gga tea cag gca ccc gec acc acg ccc age caa ctt tta eat ttc tag cag 

2581 aga egg ggc ttc acc atg ttg gec agg ccg gtc ctg aac ccc tga cct eag gcg ate cgc 

2641 tgg ccc egg cct ccc aga gtg ccg gga eta tag gcg tga gee tct ggc ccg gec etc cct 

2701 ttt tec tct cct etc ctg ccg aga gtg gaa cac acg cgt ccc ggg age ege ate teg tgc 

2761 agg gec cag ccg ccc ctg ggg ace gca gga acc acc ccc cct ggg ccc tgg acc egg acc 

2821 ggg gec tec cca cct ecc tec egg ctg tgc cct acg gag ccc caa tec agg cct ccc gcg 

2981 gec gtt ggg ttc cag atg ctg cag ccc cat gtg ace tec aag cag gee etc cgc ccc ccc 

2941 tgc cga atg gag gag ccg ggg gec ccc eag gec aac tgg aaa ate tee cag get agg cca 

3001 acc gec cct tgc acc tec ccg ttc ctg tea cac ttc ccc age cce acc etc ccc ccc tga 

3061 tgc cct gaa age etc egg aat tga ctg tga cca ctt gga tgt cac cac cgt cag ccc ctg 

3121 ccc cga tgt ccc cac tta gee ate ccc atg gag ccc ccg ccg gag ggc cct gaa cce tgc 

3131 acc gcg tgg ccg ccc age cag ctg ccc cct gtc ctg gga gga ggc etc ccg ggc gee ccc 

3241 ate tgg cgc gtc cac egg agg gtc cca cag gag agg cag cag agg ggt cag ggg agg tct 

3301 cct gec ggg ggc egg cct ccc aag cct cag ggg ttc cag cct get gaa eac acc cca cct 

3361 ggt ggg tgg ccc etc cga tgt ccc cac tga tgg etc cga cac cgc gtc ggt ggc gat gtc 

3421 cca gac aat ccc acc agg acg gec cag aca tec cca ctg get teg ctg gtg get cat etc 

3481 gaa cat cca cgc cag ecc etc egg ggc egg cca ccc agg ccg cct gtc cgc ctg tec ccc 

3541 etc eag cag cac ccc ctg gec cet gga gcg gcg ggg cca tgg caa gag aca ccg tgg cgt 

3601 etc acg cga acc tec ccg ggc acc gtg gee tta ttt cct aat tga etc aag aaa caa acc 

3661- tga aga ccg ccc ggt gaa aaa aaa aaa aaa aaa agg gcg gec gc 



Q { I h S? 



hsFATP4 

1 cga ccc acg cgt ccg ggc ggg egg ggc egg gcg gcg ggc ggg gee ggc ggg gcg gec ggg 

61 cca tgc agg gcg cag age egg eta aac cct get gag acc egg etc cgt gcg tec agg ggc 
121 ggc taa tgc ccc tea cgc tgt eta cgc tgc tgc aac egg gec gca tct gga egg ggc gec 
181 gcg egg egg age cga cgc egg gec aca atg ccg ctt gga gec tct ctg gtg ggg gtg ctg 

MLI*GASLVGVIi 
241 ctg ttc tec aag ctg gtg ctg aaa ctg ccc tgg acc cag gtg gga ttc tec ctg ttg etc 

OFSKLVtKLPWTQVGFSLLF 
301 etc tac ttg gga tct ggc ggc tgg cgc ttc ate egg gtc ttc ate aag acc ate agg cgc 

LYLGSGGWRFIRVFIKTIRR 
361 gat ate ttt ggc ggc ctg gtc etc ctg aag gtg aag gca aag gtg cga cag tgc ctg cag 

DIFfGGLVLIiKVKAKVRQCLQ 
421 gag egg egg aca gtg ccc att ttg ttt gec tct acc gtt egg cgc cac ccc gac aag acg 

ERRTVPILFASTVRRHPDKT 
491 gec ctg ate ttc gag ggc aca gat acc cac tgg acc ttc cgc cag ctg gat gag tac tea 

A L I FEGTDTHWT F RQLDEYS 
541 age agt gta gec aac ttc ctg cag gec egg ggc ctg gec teg ggc gat gtg get gec ate 

S SVANF&QARGL ASGDVAAI 
601 ttc atg gag aac cgc aat gag ttc gtg ggc eta tgg ctg ggc atg gec aag etc ggt gtg 

FMENRKBFVGLWLGMAKLGV 
661 gag gca gec etc ate aac acc aac ctg egg egg gat get ctg etc cac tgc etc ace acc 

EAALINTNLRRDALLHCLTT 
721 teg cgc gca egg gee ctt gtc ttt ggc age gaa atg gec tea gec ate tgt gag gtc cat 

SRARALVFGSEMAS.AICEVH 
781 gec age ctg gac ccc teg ccc age etc ttc tgc tct ggc tec tgg gag ccc ggt gcg gcg 

ASLDPSLSLFCSGSWEPGAV 
841 cct cca age aca gaa cac ctg gac cct ctg ctg aaa gat get ccc aag cac ctt ccc agt 

PPSTEHLDPLLKDAPKHLPS 
901 tgc cct gac aag ggc ttc aca gat aaa ctg ttc tac ate tac aca tec ggc acc aca ggg 

C PDKGFTDKLFY I YTSGTTG 
961 ctg ccc aag gec gec ate gtg gtg cac age agg tat tac cgc atg get gee ctg gtg tac 

LPKAAIVVHSRVVRMAALVY 
r== 1021 tat gga ttc cgc atg egg ccc aac gac ate gtc tat gac tgc etc ccc etc tac cac tea 

ygfrmrpjtdivydclplyks 

yjj 1081 gca gga aac ate gtg gga ate ggc cag tgc ctg ctg cat ggc atg acg gtg gtg att egg 

Jj« AGWIVGIGQCLLHGMTVVIR 

-j- 1141 aag aag ttc tea gec tec egg ttc tgg gac gat tgt ate aag tac aac tgc acg att gtg 
" KKFSASRFWDDCIKYNCTIV 

."Z 1201 cag tac att ggt gaa ctg tgc cgc tac etc ctg aac cag cca ccg egg gag gca gaa aac 
U| Q Y I GELCRYLLNQ PPREAEN 

2 r§ 1251 cag cac cag gct cgc at S gca eta ggc aat ggc etc egg cag tec ate tgg acc aac ttt 

' QHQ V F- MAL G'NG L R Q S IWTNF 

Q 1321 tec age cgc ttc cac ata ccc cag gtg get gag ttc tac ggg gee aca gag tgc aac tgt 
"J SSRFHIP-QVAEFYGATECNC 

s^s 1381 age ctg ggc aac ttc gac age cag gcg ggg gee tgt ggt ttc aat age cgc ate ctg tec 
„" SLGNFDSQVGACGFSfSRILS 

1441 ttc 9tg tac ccc atc c " ttg sta ^ gtc aac gag gac acc atg sag ctg ate egg ggg 
O " FVYPIRLVRVNEDTMELIRG 

~~ 1501 ccc gac ggc gtc tgc atc cct = to< = cag cca ggt gag ccg ggc cag ctg gtg ggc cgc atc 
y^J PDGVCIPCQPGEPGQLVGRX 
r§ = 1561 atc cag aaa gac ccc ctg cgc cgc ttc gat g 3C tac etc aac cag ggc gec aac aac aag 
=^ IQKDPLRRFDGYLNQGAMHK 
1.5 1 1621 ^3 act gcc ^3 ga£ gc c ttc aag aag ggg gac cag gec tac ctt act ggt gat gtg ctg 

KIAKDVFKKGDQAYLTGDVL 
igy 1681 gtg atg gac gag ctg ggc tac ctg tac ttc cga gac cgc act ggg gac acg ttc cgc tgg 

VMDELGYLYFRDRTGDTFRW 
T~r 1741 aaa ggt gag aac gtg tec acc acc gag gtg gaa ggc aca etc age cgc ctg ctg gac atg 
KGENVSTTEVEGTLSRLLDM 
1B01 get gac gtg gec gtg tat ggt gtc gag gtg cca gga acc gag ggc egg gee gga atg get 

ADVAVYGVEVPGTEGHAGMA 
1861 get gtg gec age ccc act ggc aac tgt gac ctg gag cgc ttt get cag gtc ctg gag aag 

AVAS PTGNCDLERFAQVI»EK 
1921 gaa ctg ccc ctg tat gcg cgc ccc atc ttc ctg cgc etc ctg cct gag ctg cac aaa aca 

SLPLYARPIPtaLLPELHKT 
1981 gga acc tac aag ttc cag aag aca gag eta egg aag gag ggc ttt gac ccg get att gtg 

GTYKFQKTELRKEGFDPAIV 
2041 aaa gac ccg ctg ttc tat eta gat gec cag aag ggc cgc tac gtc ccg ctg gac caa gag 

KDP EiFYLDAQKGRYVPtiDQE 
2101 gec tac age cgc atc cag gca ggc gag gag aag ctg tga ttc ccc cca tec etc tga ggg 

AYS RIQAGEEKL* 
2161 ccg gcg gat get gga tec gga gec cca ggt tec gec cca gag egg tec tgg aca agg cca 
2221 gae caa age aag cag ggc ctg gca cct cca tee tga ggt get gee cct cca tec aaa act 
2281 gec aag tga etc att gec ttc cea acc ctt cca gag get ttc tgt gaa agt etc atg tec 
2341 aag ttc cgt ctt ctg ggc tgg gca ggc cct ctg gtt ccc agg ctg aga ctg acg ggt ttt 
2401 etc agg atg atg tct tgg gtg agg gta ggg aga gga caa ggg gtc acc gag ccc ttc cca 
2461 gag age agg gag ctt ata aat gga acc aga gca gaa gtc ccc aga etc agg aag tea aca 
2521 gag tgg gca ggg aca gtg gta gca tec atc tgg tgg cca aag aga atc gta gec cca gag 
2581 ctg ccc aag ttc act ggg enc cac ccc cac etc cag gag ggg agg aga gga cct gac atc 
tgt agg tgg ccc ctg atg ccc cat eta cag cag gag gtc agg acc acg ccc ctg gec tct 
ccc cac tec ccc atc etc etc cct ggg tgg ctg cct gat tat ccc tea ggc agg gee tct 
cag tec ttg tgg gtc tgt gtc acc tec atc tea gtc ttg gec tgg eta tga ggg gag gag 
2821 gaa tgg gag agg ggg etc agg ggc caa taa act ctg cct tga gtc etc eta aaa aaa aaa 
2881 aaa aaa aaa aaa aaa aaa aaa aaa aaa ggg egg ccg c 



2641 
2-701 
2761 



47 



Protein sequence 



646 a. a. MRAfGAGAASW . VYTRICSGAF.M* 
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Protein sequence 



646 a. a. MRAJPGAGAASW ... VYTRICSGAFAL 



546 Amino Acids 



7 ID 62 Dal ton 



MW MW(%J 



A aLa 
C cys 
D asp 
E glu 
F phe 
G gly 
tt his 
I ila 
K lys 
L leu 
M met 
N asn 
P pro 
Q gin 
R arg 
S ser 
T thr 

V val 
W trp 
X ukw 

V tyr 
Z — 



alanine 

cysteine 

a spar tic acid 

glutamic acid 

phenylalanine 

glycine 

histidine 

isoleucine 

lysine 

leucine 

methionine 

asparagine 

proline 

giutamine 

arginine - 

serine 

threonine 

valine 

tryptophan 

unldiown 

tyrosine 

STOP 



64 
15 
30 
31 
29 
63 
13 
29 
22 
77 
11 
15 
29 
25 
49 
33 
27 
51 
9 

24 



9.9 
2.3 
4.6 
4.8 
4.5 
9,3 
2.0 
4,5 
3.4 
11.9 
1.7 
2.3 
4.5 
3.9 
7.6 
5.1 
4.2 
7.9 
1.4 

3.7 



4546 
1545 
3450 
4000 
4264 
3592 
1781 
3279 
2318 
B7G7 
1441 
1710 
2814 
3201 
7648 
2872 
2728 
5052 
1674 

3913 



6.4 
2.2 
4.9 
5.6 
6.0 
5-1 
2.5 
4.6 
4.0 

12.3 
2.0 
2.4 
4.0 
4.5 

10.8 
4.0 
3.8 
7.1 
2.4 

5.5 



.isFATP4.pep -> KD H^drophobicity <11/1> 

?rotein sequence 643 a. a. MLLGASLVGVLL ... AYSRIQAGEEKL 
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hsFATP4.pep -> A* A. Usage 

Protein sequence 643 a. a. MLLGASLVGVLL ... AYSRIQAGEEKL 





643 Amino Acids 


MW : 


72018 


Dal ton 








n 


n{%) 


MW 




A ala 


alanine 


46 


7.2 


3267 


4.5 


C cys 


cysteine 


16 


2.5 


1648 


2.3 


D asp 


aspartic acid 


33 


5.1 


3795 


5.3 


E glu 


glutamic acid 


33 


5.1 


4258 


5.9 


F phe 


phenylalanine 


34 


5.3 


5000 


6.9 


g gly 


glycine 


54 


8.4 


3079 


4.3 


H his 


histidine 


12 


1.9 


1644 


2.3 


r ile 


isoleucine 


30 


4.7 


3392 


4.7 


K lys 


lysine 


31 


4.8 


3970 


5.5 


L leu 


leucine 


76 


11.8 


8594 


11.9 


M met 


methionine 


12 


1.9 


1572 


2.2 


K asn 


asparagine 


21 


3.3 


2394 


3.3 


P pro 


proline 


31 


4.8 


3008 


4.2 


Q gin 


glut amine 


23 


3.6 


2945 


4.1 


R arg 


arginine 


45 


7.0 


7024 


9.8 


S ser 


serine 


' 35 


5.4 


3046 


4.2 


T thr 


threonine 


32 


5.0 


3233 


4.5 


V val 


valine 


46 


7.2 


4557 


6.3 


W trp 


tryptophan 


8 


1.2 


1488 


2.1 


X ukw 


unknown 










Y tyr 


tyrosine 


25 


3.9 


4076 


5.7 


z — 


STOP 











O 

m 
m 
a 

s 

m 
w 
■m 



in 



2 



CO 



in 

CM 



in 



in 



51 
51 



101 
101 



151 
151 




t genomic) 

G§pc|| hFATPlcon.seq ORF 

T^T^ mFATPl.seq ORF (fran genanic} 



C;^0ei^^f : ^g^A^^C C hFATPlcon.seq ORF 
T;^^^S^S&2MSSi€Slw G mFATPl.seq ORF (from genanic} 

" C |^S G 1S8 ^ATPlccti.seq GRF 
G$Sy A^TAl mFATPl.seq ORF (fran genanic) 



Tfe^^^: hFATPlcon.seq ORF 
Ci^jfifei mFATPl.seq ORF (fran genanic) 



£ G"G 
fC T-;G 



3 l$§|«gl§ A|i^ l^S^SSMS^Sm A|^i^^^^ Ail G T mFATPl.seq ORF (fran genomic) 



|TATA| 



Jg^P hFATPlccn.seq ORF 
i£G£St mFATPl.seq ORF (fran genanic} 




1151 
1151 

1201 
1201 

1251 
1251 

1301 
1301 



^G|^fe hFATPlccn.seq ORF 

^JKS mFATPl.seq ORF (fran genanic) 



C|pgj T hFATPlccn.seq CRF 
T?«ic mFATPl.seq ORF (fran genanic) 




1601 3£jes 

1601 Wfe? 

1651 SfiA^Gi 
1651 ffi^T^C 

1701 G^d^ 
1701 Afjsifli G 'T A;- 



hFA.TPlcai.seq ORF 
iriFATPl.seq ORF (frcm genomic] 

| hFATPlcon.seq ORF 
i; mFATPl.seq ORF (frcm genomic) 

hFATPlcon.seq ORF 
mFATPl.seq ORF (from genomic) 

hFATPlcon.seq ORF 
inFATPl . seq ORF < frcm genomic} 

hFATPlcon.seq ORF 
H TpFATPl.seq ORF (frcm genomic) 

Iff! G hFATPlcon. seq ORF 

A mFATPl.seq ORF (from genanic) 

A^rtftS AffgMg^S T^^S^S^ M ^mMmSM ^WM Tl rtEHTPl.seq ORF (frcm genanic) 



r?51 |p*p^, 
1751 S^-Cf 




(frcm genanic) 



Si ^|§pGAL_ 



%»9M§S ^ « C^;^^^^ C;§ hFATPlcaruseq ORF 



Ai& mFATPl.seq ORF (frcm genanic) 



ttb. c|||||lg: T;Ji c a%5 T-^^pp egg c t|@ g|^g|||^ c^eg| 



»1 G^MSl Cl T GfS cS#J|§£1 Till T G^ Afefe^ A§§^| t|| A* 



r- T G A 



G hFATPlcon.seq ORF 

A mFATPl.seq ORF (frcm genanic) 

hFATPlcon.seq ORF 
jmFATPl^seq ORF {frcm genanic) 



decoration 'Decoration #1': Shade (with solid bright yellow) residues that match the consensus named 'Consensus #1' exactly. 



Alignment Heport of FATP4 DNA alignment, using Clustal method wilh Weighted residue weight table. 

■HBMaSlflil 



hsEKTP4 
innFATP4 



A§|a rtmHKTP4 




Alignment Report of FATP4 DNA alignment, using Clustal method with Weighted residue weight tabJe. 



I860 WT .-,_ W _. 
I860 iSMfc^ 



Decoration 'Decoration #1* ; Shade (with solid bright yellow) residues that match the consensus named 'Consensus #1' exactly. 



hsEKTP4 
innEATP4 



Page 2 




Alignment Report of Untitled, using Clustal method with PAM25Q residue weight table. 



" Page 




hsEETPlpep 
nmEKFPlpep 

hsEMPPlpep 
nrnEATPlpep 

hsFATPlpep 
romEATPlpep 

hsFETPlpep 
imiEATPlpep 



^gpcoration 'Decoration #2': Shade (with solid bright yellow/) residues that match the consensus named ' Consensus #1' exactly. 



Alignment Report (rf FATP4 protain alignment, using ClustaJ method with PAM250 residua weighttable. 
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hsE£TP4pep 
iOTF?)TP4psp 



elation 'Decoration #1': Shade (with solid bright yellow) residues that natch the consensus named 'Consensus #1' exactly. 



^XS^XXKJL 33 



hsFATP6 

1 aac ggc aag taa gcg caa cgc aat taa tgt gag bag etc act cat tag gca ccc cag get 

61 tta cac ttt atg etc ccg ggc teg tat gtt gtg tgg aat tgt gag egg ata cca att tea 
121 cac agg aac cag eta tga cat gat tac gaa ttt aat acg act cac tat agg gaa ttt ggc 
181 cct cga ggc caa gaa ttc ggc acg agg ggt get gag ccc ctg cgc ggt ttc tgg tgc gta 
241 gag act gta aat cgc tgc get tct cag tea tea tea tec cag ctt ttc ccg get cga att 
301 cag cct cca act caa get cgc ggg aaa gac tac ctg aga gga gaa aag ctt ctg tec ctg 
361 gac ctt ctt ctg agg gtg gag teg gag get ccc tgc ttt cca gec gec cag tga ccc aag 
421 ctt aat ctt cag cac cac ttg ggg cga cct ttt egg tgc aaa cct acg att ctg ttt etc 
481 agg act cct ccc cat ccc get teg ccc egg aaa age tga caa gaa ctt cag gtg taa gec 
541 ctg agt agt^ag gat ctg egg tct ccg tgg aga get gtg cct gga aga gaa gga cgc tgg 
501 tgg ggg ctg 'aga tea gag ctg tct tct ggc cca gtt gec ccc atg ctt ctg tea tgg eta 

M L L S W L 

561 aca gtt eta ggg get gga atg gtc gtc ctg cac ttc ttg cag aaa etc ctg ttc cct tac 

TVLGAGHVVLHFLQKLLFPY 
721 ttt tgg gat gac ttc tgg ttc gtg ttg aag gtg gtg etc att ata att egg ctg aag aag 

FWDDFWFVLKVVLI IIRLKK 
781 tat gaa aag aga ggg gag ctg gtg act gtg ctg gat aaa ttc ttg agt cat gec aaa aga 

YEKRGELVTVLDKFLSHAKR 
841 caa cct egg aaa ccc ttc ate ate tat gag gga gac ate tac ace tat cag gat gta gac 

QPRKPFIIYEGDIYTYQDVD 
901 aaa agg age age aga gtg gec cat gtc ttc ctg aac cat tec tct ctg aaa aag ggg gac 

KRSSRVAHVFLNHSSLKKGD 
961 acg gtg get ctg ctg atg age aat gag ccg gac ttc gtt cac gtg tgg ttc ggc etc gee 

TVALIiMSNE PDFVHVWFGLA 
1021 aag ctg ggc tgc gtg gtg gec ttt etc aac acc aac att cgc tec aac tec etc ctg aat 

KLGCVVAFLHTNIRSNSIiLH 
1031 tgc ate cgc gee tgt ggg ccc aga gec eta gtg gtg ggc gca gat ttg ctt gga acg gta 

C IRACGPRALVVGADLLGTV 
1141 ■ gaa gaa ate ctt cca age etc tea gaa aat ate agt gtt tgg ggg atg aaa gat tct gtt 

EEILPSLSENISVWGMKDSV 
1201 cca caa ggt gta att tea etc aaa gaa aaa ctg age acc tea cct gat gag ccc gtg cca 
IsJ PQGVISLKEKLSTSPDEPVP 
.s». 1261 C< 3 C asc <=ac cat gtt gtc tea etc etc aag tct act tgt ctt tac att ttt acc tct gga 
"^f RSHHVVSLLKSTC LYIPTSG 

jjL. 1321 aca aca ggt eta cca aaa gca get gtg att agt cag ctg cag gtt tta agg ggt tct get 

TTGLPKAAVISQLQVLRGSA 

-»d 1381 gtc ctg tgg get ttt ggt tgt act get cat gac att gtt tat ata acc ctt cct ctg tat 
§jl VLWAFGCTAHBI vy itlply 

rJ. 1441 cac agt tca 9 ca 9Ct ate ctg gga att tct gga tgt gtt gag ttg ggt gec act tgt gtg 
U| H S SAA I L G I.SGCVELGATCV 

^-a, 1501 tta aag aag aaa ttt tca gca age cag ttt tgg agt gac tgc aag aag tat gat gtg act 
%s£ LKKKFSASQFWSDC KKYDVT 

j= 1561 gtg ttt cag tat att gga gaa ctt tgt cgc tac ctt tgc aaa caa tct aag aga gaa gga 

- VFQYIGELCRYLCKQSKREG 

£ 1621 gaa aag gat cat aag gtg cgt ttg gca att gga aat ggc ata egg agt gat gta tgg aga 
fs=* ' EKDHKVRLAIGNGIRSDVWR 

1—1 1681 gaa ttt tta gac aga ttt gga aat ata aag gtg tgt gaa ctt tat gca get acc gaa tca 
y~| EFLDRFGWIKVCELYAATES 

-™ 1741 age ata tct etc atg aac tac act ggg aga att gga gca att ggg aga aca aat ttg ttt 
I U sisfmnytgrigaigr'tnlf 

I - s 1801 taC aaa ctt Ctt tCC act Ctt gac tfca ata tat 9 ac ctt ca 9 3,33 9 ac 9 aa ccc at S 

UJ ykllstfdlikydfqkdepm 
.f, 1361 a sa aat gag cag ggt egg tgt att cac gtg aaa aaa gga gaa cct gga ctt etc att tct 

&NEQGWCIHVKKGEPGLLIS 
yQ 192 1 cga gtg aat gca aaa aat ccc ttc ttt ggc tat get ggg cct tat aag cac aca aaa gac 
RVNAKEJPFFGYAGPYKHTKD 
1981 aaa ttg ctt tgt gat gtt ttt aag aag gga gat gtt tac ctt aat act gga gac tta ata 

KLLCDVFKKGDVYLNTGDLI 
2041 gtc cag gat cag gac aat ttc ctt tat ttt tgg gac cgt act gga gac act ttc aga tgg 

VQDQDNFLYFWDRTGDTFRW 
2101 aaa gga gaa aat gtc gca acc act gag gtt get gat gtt att gga atg ttg gat ttc ata 
kgenvattevadv IGMLDFI 
cag gaa gca aac gtc tat ggt gtg get ata tca ggt tat gaa gga aga gca gga atg get 
QEAMVYG-VAISGYEGRAGMA 
tct att att tta aaa cca aat aca tct tta gat ttg gaa aaa gtt tat gaa caa gtt gta 
S IILKPKTSLDLEKVYEQVV 
aca ttt eta cca get tat get tgt cca cga ttt tta aga att cag gaa aaa atg gaa gca 
TFLPAYAC PRFLRIQE.KMEA 
aca gga aca ttc aaa eta ttg aag cat cag ttg gtg gaa gat gga ttt aat cca ctg aaa 
TGTFKLLKHQLVEDGFNPLK 
att tct gaa cca ctt tac ttc atg gat aac ttg aaa aag tct tat gtt eta ctg acc agg 
r SEPtYFMDNLKKSYVLLTR 
gaa ctt tat gat caa ata atg tta ggg gaa ata aaa ctt taa gat ttt tat ate tag aac 
EiYDQIHLGEIKL* 
2521 ttt cat atg ctt tct tag gaa gag tga gag ggg ggt ata tga ttc ttt acg aaa tgg gga 
2531 aag gga get aac att aat tat gca tgt act ata ttt cct taa tat gag aga taa ttt tct 
2641 aat tgc ata aga att tta att tct ttt aat tga tat aaa cat tag ttg att act ctt ttt 



2161 



2461 



2701 
2751 
2S21 

2881 gec gc 



ate tat ttg gag att cag tgc ata act aag tat ttt cct taa tac taa aga ttt taa ata 
ata aat agt ggc tag egg ttt gga caa tca eta aaa atg tac ttt eta ata agt aaa att 
tec aat ttt gaa taa aag att aaa ttt tac tga aaa aaa aaa aaa aaa aaa aaa ttg gcg 



Protein s equence 



619 a. a. HLLSWLTVLGAG ... LYEQXMLGEIKt, 




Pr&kSin 


SSCUGI1C3 619 


a, a 


KLLSWLTVLiG&G 


... LY 




619 Amino Acids 


m : 


70066 Dalton 








n 


n{%) 






ftekla 


alanine 


33 


5,3 


2344 


3.3 


tHys 


cysteine 


14 


2,3 


1442 


2.1 


tyisp 


aspartic acid 


34 


5.5 


3910 


5.5 


E^lu 


glutamic acid 


31 


5.0 


4000 


5.7 


i^he 


phenylalanine 


34 


5.5 


5000 


7.1 


Gfsly 


glycine 


44 


7.1 


2508 


3.6 


niis 


histidine 


13 


2.1 


1781 


2.5 


I ile 


isoleucine 


37 


6.0 


4184 


6.0 


K lys 


lysine 


43 


7.3 


6143 


s.e 


L leu 


leucine 


75 


12.1 


8481 


12.1 


M met 


methionine 


11 


1.8 


1441 


2.1 


N asn 


asparagine 


21 


3.4 


2394 


3.4 


P pro 


proline 


21 


3.4 


2038 


2,9 


Q gin 


glutamine 


18 


2.9 


2305 


3.3 


H arg 


arginine 


27 


4.4 


4214 


6,0 


S ser 


serine 


40 


6.5 


3481 


5.0 


T thr 


threonine 


30 


4.8 


3031 


4.3 


V vai 


valine 


51 


8-2 


5052 


7.2 


W ttp 


tryptophan 


11 


1.8 


2046 


2.9 


X ukw 


unknown 










Y tyr 


tyrosine 


26 


4.2 


4239 


6.1 


2 


STOP 











Alignment Report of hFATP1,4 t 6 Alignment, using CJustal method with PAM250 residue weight table. 
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1 L 

1 IHlswltvl GM G 




t^aration 'Decoration #1': Shade {with solid bright yellow) residues that natch the Consensus exactly 



hsFATPlpep 
hsEATP4psp 
hsEKTP6pep 




\ 



hsFATP4 


1 


mmFATP4 


1 






hsFATP4 


46 


mmFATP4 




1 JOi^rt lit 


48 


hsFATP4 


93 


mmFATP4 


93 


hcFATPI 


95 


hsFATP4 


140 


mmFATP4 


140 


hsFATPI 


142 


hsFATP4 


187 


mmFATP4 


187 


hsFATPI 


189 


hsFATP4 


233 


mmFATP4 


233 


hsFATPI 


236 


hsFATP4 


280 


mmFATP4 


280 


hsFATPI 


283 


hsFATP4 


327 


mmFATP4 


327 


hcPATPI 
flSrn I r 1 




hsFATP4 


374 


mmFATP4 


374 


he PA TP 1 


Of f 


hsFATP4 


421 


mmFATP4 


421 


hcFATPI 




hsFATP4 


468 


mmFATP4 


468 


hsFATPI 


471 


hsFATP4 


515 


mmFATP4 


515 


hsFATPI 


518 


hsFATP4 


562 


mmFATP4 


562 


hsFATPI 


565 


hsFATP4 


609 


mmFATP4 


609 


hsFATPI 


612 









- £/ -v;>" 
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• !i .1;? .vfi 



£ ir V \ 


7 o & 


a £ -L & K ' 




2 1 






■a sals ^. 








n mek n 



! ? k L"i in -IF il^ 'B ^jLMMM 




"3 



3? 



mmFATP4 DNA sequence 

ATGCTGCXTGGAGCCTCTCTGGTGGGC^CGCTACTGTTCTCCAAGCTAGTGCTGAAGCTGCCCTGGACCCAGGTGGGATT 
CTCCCTGTTGCTCCTGTACTTGGGGTCTGGTGGCTGGCGTTTCATCCGGGTCTTCATCAAGACGGTCAGGAGAGATATCT 

Sggtggcatggtgctcctgaaggtgaagaccaaggtgcgacggtaccttcaggagctoaa^ 

ggatgagtactccagtagtgtggccaacttcctgcaggcccggggcctggcctcaggcaatgtagttgccctctttatgg 
aaaaccgcaatgagtttgtgggtctgtggctaggcatggccaagctgggcgtggaggcggctctcatcaacaccaacctt 
aggcgggatgccctgcgccactgtcttgacacctcaaaggcacgagctctcatctttggcaotgag^^ 
ctgtgagatccatgctagcctggagcccacactcagcctcttctgctctggatcctgggagcccagcacagtgcccgtca 

GCA(^GAGCATCTGGACCCTCTTCTGGAAGATGCCCCGAAGCACCTGCCCAGTCACCCAGACAAGGOT 

ctcttctacatctacacatcgggcaccacggggctacccaaagctgccatxgtggtgcacagcaggtattatcgtatggc 

TTCCCTGGTGTACTATGGATTCCGCATGCGGCCTGATGACATTGTCTATGACTGCCTCCCCCTCTACCACTCAAGCAGGA 

aacatcgtggggattggcagtgcttactccacggcatgactgtggtgatccggaagaagttctcaggctcccggttctgg 

GATGATTGTATCAAGTACAACTGCACAGTGGTACAGTACATTGGCGAGCTCTGCCGCTACCTCCTGAACCA 

tgaggctgagtctcggcacaaggtgcgcatggcactgggcaacggtctccggcagtccatctggaccgacttctccagcg 

GTTTCCACATCCCCCAGGTGGCTGAGTTCTATGGGGCCACTGAATGCAACTGTAGCCTGGGCAACTTTGACAGGC^ 

ggggcctgtggcttcaatagccgcatcctgtcctttgtgtaccctatccgtttggtacgtgtcaatgaggataccatgga 

ACTGATCCGGGGACCCGATGGAGTCTGCATTCCCTGTCAACCAGGTCAGCCAGGCCAGCTGGTGGGTCGCATCATCCAGC 

AGGACCCTCTGCGCCGTTTCGACGGGTACCTCAACCAC^GTGCCAACAACAAGAAGATTGCTAATGATGTCTTCAAGAAG 

GGGGACCAAGCCTACCTCACTGGTGACGTCCTGGTGATGGATGAGCTGGGTTACCTGTACTTCCGAGATCGCACTG 

CACGTTCCGCTGGAAAGGGGAGAATGTATCTACCACTGAGGTGGAGGGCACACTCAGCCGCCTGCTTCATATGGCAGATG 

TGGCAGTTTATGGTGTTGAGGTGCCAGGAACTGAAGGCCGAGCAGGAATGGCTGCCGTTGCAAGTCCCATCAGCAACTGT 

GACCTGGAGAGCTTXGCACAGACCTTGAAAAAGGAGCTGCCTCTGTATGCCCGCCCCATCTTCCTGCGCrTCTTGCCTGA 

GCTGCACAAGACAGGGACCTTCAAGTTCCAGAAGACAGAGTTGCGGAAGGAGGGCTTTGACCCATCTGTTGTGAAAGACC 

CGCTGTTCTATCTGG^TGCTCGGAAGGGCTGCTACGTTGCACTGGACCAGGAGGCCTATACCCGCATCCAGGCAGGCGAG 

GAGAAGCTGTGATTTCCCCCTACATCCCTCTGAGGGCCAGAAGATGCTGGATTCAGAGCCCTAGCGTCCACCCCAGAGGG 

TCCTGGGCAATGCCAGACCAAAGCTAGGAGGGCCCGCACCTCCGCCCCTAGGTGCTGATCTCCCCTCTCCCAAACTGCCA 

AGTGACTCACTGCCGCTTCCCCGACCCTCCAGAGGCTTTCTGTGAAAGTCTCATCCAAGCTGTGTCTTCTGGTCCAGGCG 

TGGCCCCTGGCCCCAGGGTTTCTGATAGGCTCCTTTAGGATGGTATCTTGGGTCCAGCGGGCCAGGGTGTGGGAGAGGAG 

TCACTAAGiTCCCTCCAATCAGAAGGGAGCTTACAAAGGAACCAAGGCAAAGCCTGTAGACTCAGGAAGCTAAGTGGCCA 

GAGACTATAGTGGCCAGTCATCCCATGTCCACAGAGGATCTTGGTCCAGAGCTGCCAAAGTGTCACCTCTCCCTGCCTGC 



ACCTCTGGGGAAAAGAGGACAGCATGTGGCCACTGGGCACCTGTCTCAAGAAGTCAGGATCACACACTCAGTCCTTGTTT 
CTCCAGGTTCCCTTGTTCTTGTCTCGGGGAGGGAGGGACGAGTGTCCTGTCTGTCCTTCCTGCCTGTCTGTGAGTCTGTG 
TTGCTTCTCCATCTGTCCTAGCCTGAGTGTGGGTGGAACAGGCATGAGGAGAGTGTGGCTCAGGGGCCAATAAACTCTGC 
CTTGACTCCTCTTAAAAAAAAAAAAAAAAAAAJU^AAAAAAAJ^^ 




^3 A 



TnmFATP4 protein sequence 
MI^ASnVGALLFSK^K^^ 

ASMVQRHPDKTALIFEGTDTHWTFRQLDEYSSSyANFLQARGLASGN^ 
LFYIYTSGTTGLPKAAIVVHSRYYRMASL^^ 



EKL 



figure- ^3B> 



hsFATPI full lenght.DNA 



f 

10 20 30 40 

T j i i I t t t i 1 i « t ; I iiiti t rill i I 1 f 1 f ' T T ! 

TCGACCCACGGCGTCCGGGACCCCAAAGCAGAAGCCCGCA 40 
CAGTAGGCACAGCGCACCCAAGAAGGGTCCAGGAGTC !GC 80 
AGAAACAGAAAGGTCCCCGGCCTCAGCCTCCTAGTCCCTG 120 
CCTGCCTCCTGCCTGAGCTTCTGGGAGACTGAAGGCACGG 160 
CTTGCAGCTTCAGGATGCGGGCTCCGGGTGCGGGCGCGGC 200 

210 220 230 240 

, , . I I 1 . f ■ I ■ t I . 1 t I I ■ I < . ■ > I ■ T I I I 1 I 1 t I t 1 t I i 

CTCGGTGGTCTCGCTGGCGCTGTTGTGGCTGCTGGGGCTG 240 
CCGTGGACCTGGAGCGCGGCAGCGGCGCTCGGCGTGTACG 280 
TGGGCAGCGGCGGCTGGCGCTTCCTGCGCATCGTCTGCAA 320 
GACCGCGAGGCGAGACCTCTTCGGTCTCTCTGTGCTGATC 360 
CGCGTGCGCCTGGAGCTGCGGCGGCACCAGCGTGCCGGCC 400 

410 420 430 440 

, , ■ , I i . ■ ■ ! ■ ■ ■ ■ I * ■ i ■ I ' ' ■ ' ' ' t ' ' I ' ' ' ' I ' ' ' ' I 

ACACCATCCCGCGCATCTTTCAGGCGGTAGTGCAGCGACA 440 
GCCCGAGCGCCTGGCGCJGGTGGATGCCGGGACCGGCGAG 480 
TGCTGGACCTTTGCGCAGCTGGACGCCTACTCCAATGCGG 520 
TAGCCAACCTCTTCCGCCAGCTGGGCTTCGCGCCGGGCGA 560 
CGTGGTGGCCATCTTCCTGGAGGGCCGGCCGGAGTTCGTG 600 

610 620 630 640 

■ ■ ■ . i ■ ■ ■ ■ i ■ ■ ■ ■ i > • t . I i ■ i i f i i > i I i i i i I ' ' ' ' I 

GGGCTGTGGCTGGGCCTGGCCAAGGCGGGCATGGAGGCCG 640 
CGCTGCTCAACGTGAACCTGCGGCGCGAGCCCCTGGCCTT 680 
CTGCCTGGGCACCTCGGGCGCTAAGGCCCTGATCTTTGGA 720 
GGAGAAATGGTGGCGGCGGTGGCCGAAGTGAGCGGGCATC /60 
TGGGGAAAAGTTTGATCAAGTTCTGCTCTGGAGAC I TGGG 800 

810 820 830 840 

, ■ ■ i I i I I I i I i I t I t I I ■ I ' t III I I I I I 1 I I 1 I, I I T I 1 

GCCCGAGGGCATCTTGCCGGACACCCACCTCCTGGACCCG 840 
CTGCTGAAGGAGGCCTCTACTGCCCCCTTGGCACAGATCC 880 
CCAGCAAGGGCATGGACGATCGTCTTTTCTACATCTACAC 920 
GTCGGGGACCACCGGGCTGCCCAAGGCTGCCATTGTCGTG 960 
CACAGCAGGTACTACCGCATGGCAGCCTTCGGCCACCACG 1000 

1010 1020 1030 1040 

; ■ i . I t i i ■ 1 i i I i I i i I I I I I I I t I 1 1 t I I I I I 1 I ! 1 1 I 

CCTACCGCATGCAGGCGGCTGACGTGCTCTATGACTGCCT 1040 
GCCCCTGTACCACTCGGCAGGAAACATCATCGGCGTGGGG 1080 
CAGTGTCTCATCTATGGGCTGACAGTCGTCCTCCGCAAGA 1 1 20 
AATTCTCGGCCAGCCGCTTCTGGGACGACTGCATCAAGTA 1 160 
CAACTGCACGGTGGTTCAGTACATCGGGGAGATCTGCCGC 1200 



hsFATPI full lenght.DNA 



1210 1220 1230 1240 

! t T t 1 f > t t \ i t t , I i t f ; ! i i t t 1 » t i i 1 ? f t I I ! I 1 f ( 

TACCTGCTGAAGCAGCCGGTGCGCGAGGCGGAGAGGCGAC 1240 
ACCGCGTGCGCCTGGCGGTGGGGAACGGGCTGCGTCCTGC 1280 
CATCTGGGAGGAGTTCACGGAGCGCTTCGGCGTACGCCAA 1320 
ATCGGGGAGTTCTACGGCGCCACCGAGTGCAACTGCAGCA 1360 
TTGCCAACATGGACGGCAAGGTCGGCTCCTGTGGTTTCAA 1 400 

1410 1420 1430 1440 

. . . i i ■ ■ i . I i < ■ ■ i ■ ' ■ . I i ' • ■ i ' ' ' ■ I ■ ■ ■ ' I ' ' ■ ' I 

CAGCCGCATCCTGCCCCACGTGTACCCC ATCCGGCTGGTG 1440 
AAGGTCAATGAGGACACAATGGAGCTGCTGCGGGATGCCC 1480 
AGGGCCTCTGCATCCCCTGCCAGGCCGGGGAGCCTGGCCT 1520 
CCTTGTGGGTCAGATCAACCAACAGGACCCGCTGCGCCGC 1560 
TTCGATGGCTATGTCAGCGAGAGCGCCACCAGCAAGAAGA 1600 

1610 1620 1630 1640 

t I 1 I I I « I T ! t I T 1 E I ... I I I T ! 1 . 1 T ■ I 1 ' t T ! I t I F I 

TCGCCCACAGCGTCTTCAGCAAGGGCGACAGCGCCTACCT 1640 

CTCAGGTGACGTGCTAGTGATGGATGAGCTGGGC i ACATG 1680 

TACTTCCGGGACCGTAGCGGGGACACC TTCCGCTGGCGAG 1720 

GGGAGAACGTCTCCACCACCGAGGTGGAGGGCGTGCTGAG 1760 

CCGCCTGCTGGGCCAGACAGACGTGGCCGTCTATGGGG I G 1800 

1810 1820 1830 1840 

. ■ i . i » t . i I i i i , i . . ■ i I i . i i i i i i i I < i i i l i i t \ 1 

GCTGTTCCAGGAGTGGAGGGTAAGGCAGGGATGGCGGCCG 1840 
TCGCAGACCCCCACAGCCTGCTG-GACCCCAACGCGATATA 1880 
CCAGGAGCTGCAGAAGGTGCTGGCACCCTATGCCCGGCCC 1920 
ATCTTCCTGCGCCTCCTGCCCCAGGTGGACACCACAGG'CA 1960 
CCTTCAAGATCCAGAAGACGAGGCTGCAGCGAGAGGGCTT 2000 

2010 2020 2030 2040 

t . > ■ 1 » t i i I * t t i 1 i * r i I t i i r ! r t f i | T I T t I ' I T 1 j 

TGACCCACGCCAGACCTCAGACCGGCTCTTCTTCCTGGAC 2040 
CTGAAGCAGGGCCACTACCTGCCCTTAAATGAGGCAGTCT 2080 
ACACTCGCATCTGCTCGGGCGCCTTCGCCCTCTGAAGCTG 2120 
TTCCTCTACTGGCCACAAACTCTGGGCCTGGTGGGAGAGG 2160 
CCAGCTTGAGCCAGACAGCGCTGCCCAGGGGTGGCCGCC I 2200 

2210 2220 2230 2240 

■ ■ ■ ' I < • < > I t ... I ... f I I I ■ ! I I I I I I I I I ' I I I I I , I 

AGTACACACCCACCTGGCCGAGCTGTACCTGGCACGGCCC 2240 
ATCCTGGACTGAGAAACTGGAACCTCAGAGGAACCCGTGC 2280 
CTCTCTGCTGCCTTGGTGCCCCTGTGTCTGCCTCCTCTCC 2320 
CTGCTTTTCAGCCTCTGTCTCCTTCCATCCCTG : CCCTGT 2360 
CTGGCCTTAACTCTTCCCTCTCTTTCTTTTCTTTCTTTC I 2400 

2410 2420 ■ 2430 2440 

. ■ . . i .... I ... . i . ... l . . . i i . . . . I i . . . ) i i i i I 

TTCTTTTTTTTTAAGATAGAGTCTC ACTCTGCTGCCCGGG 2440 
CTAGAGTGCAGTGGTGGGATCTCGGCTCACTGCAACCTCT 2480 
GCCTCCTGGGGTTCAAGTGATCCTCCCACCTCAGCCTCCT 2520 
GAGTAGCTGGGATTACAGGCACCCGCCACCACGTCCAGCT 2560 
AATTT'TTATATTTTTAGTAGAGACGGGGTTTCACCATGTT 2600 




hsFATPI full lenght.DNA 



2610 2620 2630 2640 

GGTCAGGCTGGTCTTGAACTCCTGACCTCAGGTGATCCGC 2640 
TGGCCTCGGCCTCCCAGAGTGCTGGGATTATAGGCGTGAG 2680 
CCTCTGGCCCGGCCTTTCCTTTTTCCTCTCCTCTCCTGCC 2720 
GAGAGTGGAACACACGTGTCCTGGGAGCTGCATCTTGTGT 2760 
AGGGTCCAGCTGCTTTTGGGGACTGCAGGAATCATCTCCC 2800 

2810 2820 2830 2840 

* * > I I I 1 t I I I t T t I 1 1 t T I ! I t 1 1 t t 1 t i [ t [ f \ 1 1 1 | 1 

CTGGGCCCTGGACTCGGACTGGGGCCTCCCCACCTCCCTC 2840 
TCGGCTGTGCCTTACGGAGCCCCAATCCAGGCCTCCTGTG 2880 
GCTGTTGGGTTCCAGATGCTGCAGCTCCATGTGACTTCCA 2920 
AGCAGGCCCTCCGCCCTCCCTGCTGAATGGAGGAGCCGGG 2960 
GGTCCCCCAGGCCAACTGGAAAATCTCCCAGGCTAGGCC A 3000 

3010 3020 3030 3040 

•*^*tT itt^j TTTrl t t x t ! r t i i T t t i i I « t t ) t i t t t 1 

ATTGCCTTTTGCACTTCCCCGTTCCTGTCACATTTCCCCA 3040 
GCCCCACCTTCCCCTCCTGATGCCCTGAAAGCTTCCGGAA 3080 
TTGACTGTGACCACTTGGATGTCACCACTGTCAGCCCCTG 3120 
CCTTGATGTCCCCATTTAGCCATCTCCATGGAGCTCCTGC 3160 
TGGAGGGCCCTGAACCCTGCACTGCGTGGCTGCCCAGCCA 3200 

3210 3220 3230 3240 

_' ' * * I ''Til f 1 i T I T t r r \ t T 1 t I f T t I 1 f r I T \ I t t t t 

GCTGCCTCCTGTCCTGGGAGGAGGCCTCCTGGGTGTCCTC 3240 
ATCTGGTGTGTCTACTGGAGGGTCCCACAGGAGAGGCAGC 3280 
AGAGGGGTCAGGGGAGGTCTCCTGCCGGGGGTTGGCCTCT 3320 
CAAGCCTCAGGGGTTCTAGCCTGTTGAATATACCCCACCT 3360 
G GTGGG TGGC CC C TC C G ATGTCCCC AC TGATGGCTCTG AC 3400 

3410 3420 3430 3440 

. ' I * t 1 I 1 I I t t ! I t f I T I t f I » T ! I I I * 1 I t t I I 1 t I f f f 

ACCGTGTTGGTGGCGATGTCCCAGACAATCCCACCAGGAC 3440 
GGCCCAGACATCCCTACTGGCTTCGCTGGTGGCTCATCTC 3480 
GAACATCCACGCCAGCCTTTCTGGGGCCGGCCACCCAGGC 3520 
CGCCTGTCCGTCTGTCCTCCCTCCAGCAGCACCCCCTGGC 3560 
CCCTGGAGTGGTGGGGCCATGGCAAGAGACACCGTGGCGT 3600 

3610 3620 3630 3640 

I I f t f 1 I f t I t 1 t r I I I 1 1 I t I I T f I ) j 1 , i I t t t 1 T 1 t t f 

C TC ATGTG AAC T TT C C TGGGC AC TGTGGTTTTATTTCCTA 3640 
ATTGATTTAAGAAATAAACCTGAAGACCGTCTGGTGAAAA 3680 
AAAAAAAAAAAAAA 3694 



hsFATPI full lertght.protein 



10 20 30 40 

t f t t I * i f , , t , , j , f , f y ? 1 i y i i 1 i , t, i % , 1 t 1 * ^ t i i i I t i i i 1 

MRAPGAGAASVVSLALLWLLGLPWTWSAAAALGVYVGSGG 40 
WRFLRI VCKTARRDLFGLSVL I RVRLELRRHQRAGHT I PR 80 
IFQAVVQRQPERLALVDAGTGECWTFAQLDAYSNAVANLF 120 
RQLGFAPGDVVAIFLEGRPEFVGLWLGLAKAGMEAALLNV 160 
NLRREPLAFCLGTSGAKAL I FGGEMVAAVAEVSGHLGKSL 200 

210 220 230 240 

* ■ ■ ■ i ' ■ < ' I * ■ ■ ■ ' ' ■ ■ ■ I ■ ' ' ' i ' ■ ■ ' I ' ' ' ■ i ' ' ' ' I 

IKFCSGDLGPEG ILPDTHLLDPLLKEASTAPLAQ IPSKGM 240 
DDRLFY I YTSGTTGLPKAA I VVHSRYYRMAAFGHHAYRMQ 280 
AADVLYDCLPLYHSAGN I I GVGQCL I YGLTV VLRKKFSAS 320 
RFWODC IKYNCTVVQYI GEI CRYLLKQPVREAERRHRVRL 1 360 
AVGNGLRPAIWEEFTERFGVRQ I GEFYGATECNCS I ANMD 400 

410 420 430 440 

t t i i f i t i i f t i i t ? lift! i t r t I f t t r I t , r r I t j i. % \ 

GKVGSCGFNSR ILPHVYP IRLVKVNEDTMELLRDAQGLC I 440 
PCQAGEPGLLVGQ I NQQDPLRRFDGYVSESATSKK I AHSV 480 
FSKGDSAYLSGOVLVMOELGYMYFRDRSGDTFRWRGENVS 520 
TTEVEGVLSRLLGQTDVAVYGVAVPGVEGKAGMAAVADPH 560 
SLLDPNA I YQELQKVLAPYAKP I FLRLLPGVDTTGTFK I Q 600 

610 620 630 640 

i i ' i t ' t ' i T i ' ' ' f i » * » t i i ■ » * t t r i f f f f f I * t I t 1 

KTRLQREGFDPRQTSORLFFLDLKQGHYLPLNEAVYTRIC 640 
SGAFAL. 647 



hsVLACS full lenght.DNA 



/ 

10 20 30 no 

I I ' I I I I I I I I t t r I I I t t I I I t i I » i ■ ■ I i r ■ i I i t i i I 

GGAATTCCAAAAAAAAAAAATACGACTACACCTGCTCCGG 40 
AGCCCGCGGCGGTACCTGCAGCGGAGGAGCTCTGTCTTCC 80 
CCTTCATCTCACGCGAGCCCGGCGTCCCGCCGCGTGCGCC 120 
CCGGCGCAGCCCGCCAGTCCGCCCGGAGCCCGCCCAGTCG 160 
CCGCGCTGCACGCCCGGGGTGAACCCTCTGCCCTCGCTGG 200 

210 220 230 240 
1 ■ 1 1 i ■ 1 ■ 1 * ' 1 ■ ' 1 ' ■ ■ ' I ' t ■ i i i ' ' ■ I > i « ■ i • < ■ ■ i 

GACAGAGGGCCCCGCAGCCGTCATGCTTTCCGCCATCTAC 240 
ACAGTCCTGGCGGGACTGCTGTTCCTGCCGCTCCTGGTGA 280 
ACCTCTGCTGCCCATACTTCTTCCAGGACATAGGCTACTT 320 
CTTGAAGGTGGCCGCCGTGGGCCGGAGGGTGCGCAGCTAC 360 
GGGCAGCGGCGGCCGGCGCGCACCATCCTGCGGGCGTTCC 400 

410 420 430 440 

■ ■ ' ■ I ' ' ■ ' 1 ' ' = ' ' ' ■ < ■ 1 ' ■ ' ■ i ■ ' ■ ' I < ■ ■ ' ' i ■ ■ ■ i 

TGGAGAAAGCGCGCCAGACGCCACACAAGCCTTTTCTGCT 440 
CTTCCGCGACGAGACTCTCACCTACGCGCAGGTGGACCGG 480 
CGCAGCAATCAAGTGGCCCGGGCGCTGCACGACCACCTCG 520 
GCCTGCGCCAGGGAGACTGCGTGGCGCTCCTTATGGGIAA 560 
CGAGCCGGCCTACGTGTGGCTGTGGCTGGGGCTGGTGAAG 600 

610 620 630 640 

' 1 1 1 1 ' ' ' 1 1 1 1 1 ' 1 1 ' ' ' 1 ' 1 ' ' I ' ' ' ' I ' ■ t ! I , . , . I 

C TGGG C TG TGC CATGGCGTGCCTCAATTAC A AC ATCCGCG 640 
CGAAGTCCCTGCTGCACTGCTTCCAGTGCTGCGGGGCGAA 680 
GGTGCTGCTGGTGTCGCCAGAACTACAAGCAGCTGTCGAA 720 
GAGATACTGCCAAGCCTTAAAAAAGATGATGTGTCCATCT 760 
ATTATGTGAGCAGAACTTCTAACACAGATGGGATTGACTC 800 

810 820 830 840 

■ ■ ' ■ 1 ■ 1 » 1 I < ' ■ ' ' ' ■ ' ' 1 ' ■ ■ ■ i ' ■ ■ ■ I ■ ' ■ t i . i i i l 

TTTCCTGGACAAAGTGGATGAAGTATCAACTGAACCTATC 840 
CCAGAGTCATGGAGGTCTGAAGTCACTTTTTCCACTCCTG '880 
CCTi ATACAT I TATAC TTCTGGAACCACAGGTCTTCCAAA 920 
AGCAGCCATGATCACTCATCAGCGCATATGGTATGGAACT 960 
GGCCTCACTTTTGTAAGCGGATTGAAGGCAGATGATGTCA 1000 
1010 1020 1030 1040 

' ' ' ' I ' ' ' ' I ' ' ' ' * ' ■ ' ' I ■ ' ' ' ' ' ' ' ' 1 ' I t I I t I . r I 

TCTATATCACTCTGCCCTTTTACCACAGTGCTGCACTACT 1040 
GATTGGCATTCACGGATGTATTGTGGCTGGTGCTACTCTT 1080 
GCCTTGCGGACTAAATTTTCAGCCAGCCAGTTTTGGGATG 1 120 
ACTGCA'GAAAATACAACGTCACTGTCATTCAGTATATCGG 1 160 
TGAACTGCTTCGGTATTTATGCAACTCACCACAGAAACCA 1200 



hsVLACS full lenght.DNA 



1210 1220 1230 1240 

f ' t t 1 r i i i 1 r i r i 1 i ; i I \ i i 1 I I I f ! « ! I t T * I T 1 t 1 f _ 

AATGACCGTGATCATAAAGTGAGACTGGCACTGGGAAATG 1240 
GCTTACGAGGAGATGTGTGGAGACAATTTGTCAAGAGATT 1280 
TGGGGACATATGCATCTATGAGTTCTATGCTGCCACTGAA 1320 
GGCA'ATATTGGATTTATGAATTATGCGAGAAAAGTTGGTG 1360 
CTGTTGGAAGAGTAAACTACCTACAGAAAAAAATCATAAC 1 400 

1410 1420 1430 1440 

J I I t f 1 f f t 1 I f t t f t t t 1 1 1 t f t I lift! t I t T f 1 t 1 T 1 

TTATGACCTGATTAAATATGATGTGGAGAAAGATGAACCT 1440 
GTCCGAGATGAAAATGGATATTGCGTCAGAGTTCCCAAAG 1480 
GTGAAGTTGGACTTCTGGTTTGCAAAATCACACAACTTAC 1520 
ACCATTTAATGGCTATGCTGGAGCAAAGGCTCAGACAGAG 1560 
AAGAAAAAACTGAGAGATGTCTTTAAGAAAGGAGACCTCT 1600 

1610 1520 1630 1640 

| | i f 1 t r < i ! t l t i ! i i t t 1 t i 1 t 1 t t l t I l r l 1 F J t I I 1 

ATTTCAACAGTGGAGATCTCTTAATGGTTGACCATGAAAA 1640 

TTTCATCTATTTCCACGACAGAGTTGGAGATACATTCCGG 1680 

TGGAAAGGGGAAAATGTGGCC ACCACTGAAGTTGCTGA I A 1720 

CAGTTGGACTGGTTGATTTTGTCCAAGAAGTAAATGTTTA 1760 

TGGAGTGCATGTGCCAGATCATGAGGGTCGCATTGGCATG 1800 

1810 1820 1830 1840 

■ i i i i ■ ■ i ■ I ■ t . t i i . . i l i i i i i i i t i I i i • i i i i i i I 

GCCTCCATCAAAATG'AAAGAAAACCATGAATTTGATGGAA 1840 
AGAAACTCTTTCAGCACATTGCTGATTACCTACCTAGT l A 1880 
TGCAAGGCCCCGGTTTC TAAGAATACAGGACACCATTGAG 1920 
ATCACTGGAACTTTTAAACACCGCAAAATGACCCTGGTGG 1960 
AGGAGGGCTTTAACCCTGCTGTCATCAAAGATGCCTTGTA 2000 

2010 2020 2030 2040 

. ■ i i i i ■ ■ t I ■ t ■ ■ i ' ■ ' ' I ■ ' ■ ■ 1 ' ■ ■ ' I ' ' ■ ' 1 ' ' ' ' I 

TTTCTTGGATGACACAGCAAAAATGTATGTGCCTATGACT 2040 
GAGGACATCTATAATGCCATAAGTGCTAAAACCCTGAAAC 2080 
TCTGAATATTCCCAGGAGGATAACTCAACATTTCCAGAAA 2120 
GAAACTGAATGGACAGCCACTTGATATAATCCAACTT i A A 2160 
TTTGATTGAAGATTGTGAGGAAATTTTGTAGGAAATTTGC 2200 

2210 2220 2230 2240 

t i i f f f'f t t I t f I 1 I r t i r 1 t 1 r < I t a t t \ t i t t < i i t t 1 

ATACCCGTAAAGGGAGACTTTTTTAAATAACAGTTGAGTC 2240 
TTTGCAAGTAAAAAGATTTAGAGATTATT ATTTTTCAGTG 2280 
TGCACCTACTGTTTGTATTTGC AAACTGAGCTTGTTGGAG 2320 
GGAAGGCATTATTTTTTAAAATACTTAGTAAATTAAATGA 2360 
AC 2362 



hsVLACS full ienght.protein 



10 20 30 40 

* * ' 1 1 I 1 1 T I 1 > I I T I I f T j T t I I 1 f 1 I I ! 1 1 1 1 1 I 1 t 1 I 

MLSAI YTVLAGLLFLPLLVNLCCPYFFQD I GYFLKVAAVG 40 
RRVRSYGQRRPART I LRAFLEKARQTPHKPFLLFRDETLT 80 
■ YAQVDRRSNGVARALHQHLGLRQGDCVALLMGNEPAYVWL 120 
i WLGLVKLGCAMACLNYN I RAKSLLHCFQCCGAKVLLVSPE 160 
I LQAAVEEILPSLKKDDVS I YYVSRTSNTDG IQSFLDKVDE 200 

l 210 220 230 240 

'f 1 I t I I < f 1 I > » >■ * I I I I I I T t t I I > T I I I 1 1 1 I 1 I I t I 1 , 

i VSTEP IPESWRSEVTFST.PALY I YTSGTTGLPKAAM I THQ 240 

! R IWYGTGLTFVSGLKADDV I Y ITLPFYHSAALL I G I HGCI 280 

I VAGATLALRTKFSASQFWDDCRKYNVTV I QY IGELLRYLC 320 

j NSPQKPNDRDHKVRLAtGNGLRGDVWRQFVKRFGDI C I YE 360 

j FYAATEGNI GFMNYARKVGAVGRVNYLQKK I ITYDLIKYD 400 

j 410 420 430 440 

' r 1 J 1 I t T f t T I t I f t J t } t t t t } 1 J * I ) ) t t t ) T 1 T I I 

VEKDEPVRDENGYCVRVPKGEVGLLVCK I TQLTPFNGYAG 440 
AKAQTEKKKLRDVFKKGQLYFNSGDLLMVDHENF I YFHDR 480 
VGDTFRWKGENVATTEVADTVGLVDFVQEVNVYGVHVPDH 520 
EGR I GMAS I KMKENHEFDGKKLFQHI ADYLPSYARPRFLR 560 
I QDT IEI TGTFKHRKMTLVEEGFNPAV I KDALYFLDOTAK 600 

610 620 630 640 

« t i r f i i i I 1 i i t i 1 i i t i t t i t i f t t r t I i i t t f i I t i I 

. MYVPMTED I YNA I SAKTLKL. 621 



hsFATP3 partial. DN A 



10 20 30 40 

! ■ ■ i I ■ r ■ ■ I ■ ' ■ ' ' ■ ' ' ■ 1 ' 1 ' ' 1 1 ' ' ' I 1 ' ' ' 1 1 ' ' ' I 

AAGTTCTCGGCTGGTCAGTTCTGGGAAGATTGCCAGCAGC 40 
ACAGGGTGACGGTGTTCCAGTACATTGGGGAGCTGTGCCG 80 
ATACCTTGTC AACC AGCCCCCGAGCAAGGCAGAACGTGGC 120 
CATAAGGTCCGGCTGGCAGTGGGCAGCGGGCTGCGCCCAG 160 
ATACCTGGGAGCGTTTTGTGCGGCGCTTCGGGCCCCTGC A 200 

210 220 230 240 

, , t t \ t t t t 1 t r t t ! i ! i i ! t f \ i t t t i i i t t t r I i 1 l 1 1 

GGTGCTGGAGACATATGGACTGACAGAGGGCAACGTGGCC 240 
ACCATCAACTACACAGGACAGCGGGGCGCTGTGGGGCGTG 280 
CTTCCTGGCTTTACAAGCATATCTTCCCCTTCTCCTTGAT 320 
TCGCTATGATGTCACCACAGGAGAGCCAATTCGGGACCCC 360 
CAGGGGCACTGTATGGCCACATCTCCAGGTGAGCCAGGGC 400 

410 420 430 440 

i . . ■ i i ■ i i 1 t i • i i ' ' ■ ' I ' ■ ' ' i * ' ■ ' 1 ■ ' ■ ' I ■ ' ■ ' I 

TGCTGGTGGCCCCGGTAAGCCAGCAGTCCCCATTCCTGGG 440 
CTATGCTGGCGGGCCAGAGCTGGCCCAGGGGAAGTTGCTA 480 
AAGGATGTCTTCCGGCCTGGGGATGTTTTCTTCAACAC I G 520 
GGGACCTGCTGGTCTGCGATGACCAAGGTTTTCTCCGCTT 560 
CCATGATCGTACTGGAGACACCTTCAGGTGGAAGGGGGAG 600 

610 620 630 640 

f < ' t I i f > t I i i i t I i i . < I i i ■ i I i I r i I i l l l I I l 1 I I 

AATGTGGCCACAACCGAGGTGGCAGAGGTCTTCGAGGCCC 640 
TAGATTTTCTTCAGGAGGTGAACGTCTATGGAGTCACTGT 680 
GCCAGGGCATGAAGGCAGGGCTGGAATGGCAGCCCTAGTT 720 
CTGCGTCCCCCCCACGCTTTGGACCTTATGCAGCTCTACA 760 
CCCACGTGTCTGAGAACTTGCCACCTTATGCCCGGCCCCG 800 

810 820 830 340 

. ■ . ■ i i ■ ■ i I ■ ■ « ' ' ' » ' ■ 1 ■ ■ 1 ' 1 ■ ■ 1 1 1 1 1 1 1 1 ' ' ' ■ I 

ATTCCTCAGGCTCCAGGAGTCTTTGGCCACCACAGAGACC 840 
TTCAAACAGCAGAAAGTTCGGATGGCAAATGAGGGCTTCG 880 
ACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGACCA 920 
GGCTGTAGGTGCCTACCTGCCCCTCACAACTGCCCGGTAC 960 
AGCGCCCTCCTGGCAGGAAACCTTCGAATCTGAGAACTTC 1000 

1010 1020 1030 1040 

f i i i ! ■ ' * ' i t i * r f ' i t i I i » t i I > t t r I t i r t I t f ! t I 

CACACCTGAGGCACCTGAGAGAGGAACTCTGTGGGGTGGG 1040 
GGCCGTTGCAGGTGTACTGGGCTGTCAGGGATCTTTTCTA 1080 
TACCAGAACTGCGGTCACTATTTTGTAATAAATGTGGCTG 1 120 
GAGCTGATCCAGCTGTCTCTGACAAAAAAAAAAAAAAAAA 1 160 
.AAAGGGCGGCCGC 1173 



hsFATP3Dartial.protein 



10 20 30 40 

.t o t L / t t t I t t i i \ t i t i \ t 1 1 f i 1 1 i t I I i i i 1 i i < i 1 

KFSAGGFWEDCQQHRVTVFQY I GELCRYLVNQPPSKAERG 40 
HKVRLAVGSGLRPDTWERFVRRFGPLQVLETYGLTEGNVA 80 
T I NYTGQRGAVGRASWLYKHI FPFSL I RYDVTTGEP IROP 120 
QGHCMATSPGEPGLLVAPVSQQSPFLGYAGGPELAQGKLL 160 
KDVFRPGDVFFNTGDLLVCDDQGFLRFHDRTGDTFRWKGE 200 

210 220 230 240 

| ' ' ■ ■ i ' ■ ■ ' I ' ' ■ ■ ' 1 ■ ' ■ I * ' ■ ' i ' ■ ' ' I ' ' ' ' i 1 1 ' ■ 1 

; NVATTEVAEVFEALDFLGEVNVYGVTVPGHEGRAGMAALV 240 
' LRPPHALDLMQLYTHVSENLPPYARPRFLRLQESLATTET 280 
• FKQQKVRMANEGFDPSTLSDPLYVLDQAVGAYLPLTTARY 320 
SALLAGNLR I. 331 




hsFATP4 full length 



i 

10 20 30 40 

i i i t 1 » r i t I i i t t I i > i t I i t i i t i i r t f t t i i 1 t i i t 1 

CGACCCACGCGTCCGGGCGGGCGGGGCCGGGCGGCGGGCG 40 
GGGCTGGCGGGGCGGCCGGGCCATGCAGGGCGCAGAGCCG 80 
GCTAAACCCTGCTGAGACCCGGCTCCGTGCGTCCAGGGGC 120 
GGCTAATGCCCCTCACGCTGTCTACGCTGCTGCAACCGGG 160 
CCGCATCTGGACGGGGCGCCGCGCGGCGGAGCCGACGCCG 200 

210 220 230 240 

i t i ' i i ■ ' i I i i ■ ' i ' ' ■ i I ' ■ ' ■ ' ■ ' ■ ' 1 ■ ' ■ ' I ■ 1 ' ' I 

GGCCACAATGCTGCTTGGAGCCTCTCTGGTGGGGGTGCTG 240 
CTGTTCTCCAAGCTGGTGCTGAAACTGCCCTGGACCCAGG 280 
TGGGATTCTCCCTGTTGTTCCTCTACTTGGGATCTGGCGG 320 
CTGGCGCTTCATCCGGGTCTTCATCAAGACCATCAGGCGC 360 
GATATCTTTGGCGGCCTGGTCCTCCTGAAGGTGAAGGCAA 400 

410 420 430 440 

■ ■ i i I t . , t I I I I I I I r I I I i i i t I < I I I I I I I I I t I I I I 

AGGTGCGACAGTGCCTGCAGGAGCGGCGGACAGTGCCCAT 440 
TTTGTTTGCCTCTACCGTTCGGCGCCACCCCGACAAGACG 480 
GCCCTGATCTTCGAGGGCACAGATACCCACTGGACCTTCC 520 
GCCAGCTGGATGAGTACTCAAGCAGTGTAGCCAACTTCCT 560 
GCAGGCCCGGGGCCTGGCCTCGGGCGATGTGGCTGCCATC 600 

610 620 630 640 

f * t i I f r t t I t t r t I t t 1 . I i r t 1 I t i i r I r ( t t 1 1 i ! 1 I 

TTCATGGAGAACCGCAATGAGTTCGTGGGCCTATGGCTGG 640 
GCATGGCCAAGCTCGGTGTGGAGGCAGCCCTCATCAACAC 680 
CAACCTGCGGCGGGATGC TCTGCTCCACTGCCTCACCACC 720 
TCGCGCGCACGGGCCCTTGTCTTTGGCAGCGAAATGGCCT 760 
CAGCCATCTGTGAGGTCCATGCCAGCCTGGACCCCTCGC i 800 

8 10 820 830 840 

■ t r I I I 1 1 I I t I I I I I I I t 1 I I I I I I I T I I I ! T 1 ! I I I I I 

CAGCCTCTTCTGCTCTGGCTCCTGGGAGCCCGGTGCGGTG 840 
CCTCCAAGC ACAGAACACCTGGACCCTCTGCTGAAAGATG 880 
CTCCCAAGCACCTTCCCAGTTGCCCTGACAAGGGCTTCAC 920 
AGATAAACTGTTCTACATCTAC ACATCCGGC ACCACAGGG 960 
CTGCCCAAGGCCGCCATCGTGGTGCACAGCAGGTATTACC 1000 

1010 1020 1030 1040 

■ ■ ■ ■ ' ' ' ■ ■ I 1 ■ • ■ ' ' ■ ' ■ 1 ■ ' ' 1 ' ■ 1 ' ' I ' ■ i = i ' 1 1 ' ^ 

GCATGGCTGCCCTGGTGTACTATGGATTCCGCATGCGGCC 1040 
CAACGACATCGTCTATGACTGCCTCCCCCTCTACCACTCA 1080 
GCAGGAAACATCGTGGGAATCGGCCAGTGCCTGCTGCATG 1 120 
GCATGACGGTGGTGATTCGGAAGAAGTTCTCAGCCTCCCG 1 160 
GTTCTGGGACGATTGTATCAAGTACAACTGCACGATTGTG 1200 



fa. 50A 



hsFATP4 full length 



1210 1220 1230 1240 

, . . i i i i ■ i I i ' ■ i i i ' ■ * i ■ ■ ■ ' 1 ' ■ ■ ■ i 1 ■ ' 1 i 1 ' ' ' I — 

CAGTACATTGGTGAACTGTGCCGCTACCTCCTGAACCAGC 1240 
CACCGCGGGAGGCAGAAAACCAGCACCAGGTTCGCATGGC 1280 
ACTAGGCAATGGCCTCCGGCAGTCCATCTGGACCAACTTT 1320 
TCCAGCCGCTTCCACATACCCCAGGTGGCTGAGTTCTACG 1360 
GGGC'CACAGAGTGCAACTGT AGCCTGGGCAACTTCGACAG 1 400 

1410 1420 1430 1440 
. , , , i i ■ f t l i i i ■ i i i i ■ l i < < i i i t i 1. 1 i t i i l i i i i .,1 

CCAGGTGGGGGCCTGTGGTTTCAATAGCCGCATCCTGTCC 1440 
TTCGTGTACCCCATCCGGTTGGTACGTGTCAACGAGGACA 1480 
CCATGGAGCTGATCCGGGGGCCCGACGGCGTCTGCATTCC 1520 
CTGCCAGCCAGGTGAGCCGGGCCAGCTGGTGGGCCGCAiC 1560 
ATCCAGAAAGACCCCCTGCGCCGCTTCGATGGCTACCTCA 1600 

1610 1620 1630 1640 

, r « r I i ... I i ■ t i I i i i i I t 1 i i 1 i i i I I r . . i 1 i r i ; I 

ACCAGGGCGCCAACAACAAGAAGATTGCCAAGGATGTCTT 1640 
CAAGAAGGGGGACCAGGCCTACCTTACTGGTGATGTGCTG 1680 
GTGATGGACGAGCTGGGCTACCTGTACTTCCGAGACCGCA 1720 
CTGGGGACACGTTCCGCTGGAAAGGTGAGAACGTGTCCAC 1760 
CACCGAGGTGGAAGGCACACTCAGCCGCCTGCTGGACATG 1800 

1810 1820 1830 1840 

I ... I . ■ i < I .... I ■ i ■ i I ■ t r I I I I I T I 1 i t 1 i I I I 1 i 

GCTGACGTGGCCGTGTATGGTGTCGAGGTGCCAGGAACCG 1840 
AGGGCCGGGCCGGAATGGCTGCTGTGGCCAGCCCCACTGG 1880 
CAACTGTGACCTGGAGCGCTTTGCTCAGGTCTTGGAGAAG 1920 
GAACTGCCCCTGTATGCGCGCCCCATCTTCCTGCGCCTCC 1960 
TGCCTGAGCTGCACAAAACAGGAACCTACAAGTTCCAGAA 2000 

2010 2020 2030 2040 

, , , . i i . ■ i l i ■ ■ ■ i ■ ■ ■ ■ I ' ' * ■ 1 1 ■ 1 ■ I 1 1 ■ 1 1 1 ' ■ 1 I 

GACAGAGCTACGGAAGGAGGGCTTTGACCCGGCTATTGTG 2040 
AAAGACCCGCTGTTCTATCTAGATGCCCAGAAGGGCCGC ! 2080 
ACGTCCCGCTGGACCAAGAGGCCTACAGCCGCATCCAGGC 2120 
AGGCGAGGAGAAGCTGTGATTCCCCCC ATCCCTCTGAGGG 2160 
CCGGCGGATGCTGGATCCGGAGCCCCAGGTTCCGCCCCAG 2200 

2210 2220 2230 2240 

, ■ . i i . ■ ■ ■ i i ■ ■ ■ i ' i ■ ' I ■ 1 ' ■ ' ' 1 ' 1 I 1 1 ' ' i ' ' 1 1 1 

AGCGGTCCTGGACAAGGCCAGACCAAAGCAAGCAGGGCCT 2240 
GGCACCTCCATCCTGAGGTGCTGCCCCTCCATCCAAAACT 2280 
GCCAAGTGACTCATTGCCTTCCCAACCCTTCCAGAGGCTT 2320 
TCTGTGAAAGTCTCATGTCCAAGTTCCGTCTTCTGGGCTG 2360 
GGCAGGCCCTCTGGTTCCCAGGCTGAGACTGACGGGTTT I 2400 

2410 2420 2430 2440 

.... I ■ ... I t ... I i t i i I i t i i I i i i i I t i r i 1 i i i i. l 

CTCAGGATGATGTCTTGGGTGAGGGTAGGGAGAGGACAAG 2440 
GGGTCACCGAGCCCTTCCCAGAGAGCAGGGAGCTTATAAA 2480 
TGGAACCAGAGCAGAAGTCCCCAGACTCAGGAAGTCAACA 2520 
GAGTGGGCAGGGACAGTGGTAGCATCCATCTGGTGGCCAA 2560 
AGAGAATCGTAGCCCCAGAGCTGCCCAAGTTCACTGGGCT 2600 



hsFATP4 full length 



2610 2620 2630 2640 

, , . . I . ■ ■ ■ 1 i t , . 1 I . . ■ 1 I r . I 1 I I 1 I I t I I I 1 t I I t 1 

CCACCCCCACCTCCAGGAGGGGAGGAGAGGACCTGACATC 2640 
TGTAGGTGGCCCCTGATGCCCCATCTACAGCAGGAGGTCA 2680 
GGACCACGCCCCTGGCCTCTCCCCACTCCCCCATCCTCCT 2720 
CCCTGGGTGGCTGCCTGATTATCCCTCAGGCAGGGCCTCT 2760 
CAGTCCTTGTGGGTCTGTGTCACCTCCATCTCAGTCTTGG 2800 

2810 2820 2830 2840 

.... 1 I t . I I . r . . \ , , , . I l i . ■ I . 1 I 1 1 I ! 1 I I 1 I I I I 

CCTGGCTATGAGGGGAGGAGGAATGGGAGAGGGGGCTCAG 2840 
GGGCCAATAAACTCTGCCTTGAGTCCTCCTAAAAAAAAAA 2880 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 2907 

fu. SOC 



) 



hsFATFM full length, protein 




MLLGASLVGVILFSKLVLKLPWTQVGFSLLFLYLGSGGWR 40 
F 1 RVF I KT I RRD IFGGLVLLKVKAKVRQCLQERRTVP ILF 80 
AS'TVRRHPDKTAL I FEGTDTHWTFRQLDEYSSSVANFLQA 120 
RGLASGOVAAIFMENRNEFVGLWLGMAKLGVEAAL I NTNL 160 
RRDALLHCLTTSRARALVFGSEMASA I CEVHASLDPSLSL 200 

210 220 230 240 



FCSGSWEPGAVPPSTEHLDPLLKDAPKHLPSCPDKGFTDK 240 
LFY I YTSGTTGLPKAAI VVHSRYYRMAALVYYGFRMRPND 280 
1 VYDCLPLYHSAGNI VGI GQCLLHGMTVV i RKKFSASRFW 320 
DOC I KYNCT I VQY I GELCRYLLNQPPREAENQHQVRMALG 360 
NGLRQSIWTNFSSRFH I PQVAEFYGATECNCSLGNFDSQV 400 

410 420 430 440 



GACGFNSRILSFVYP IRLVRVNEOTMELIRGPDGVC IPCQ 440 
PGEPGQLVGR I I QKDPLRRFDGYLNQGANNKK I AKDVFKK 480 
GDQAYLTGDVLVMDELGYLYFRDRTGDTFRWKGENVSTTE 520 
VEGTLSRLLDMADVAVYGVEVPGTEGRAGMAAVASPTGNC 560 
DLERFAQVLEKELPLYARP I FLRLLPELHKTGTYKFQKTE 600 

610 620 630 640 



LRKEGFDPAI VKDPLFYLDAQKGRYVPLDQEAYSR IQAGE 640 
EKL 643 





i. 1 t i i , i 1 . 



i i I i i i i I i i i i I i i i i I i i i i I i t i i I 




SI 



>hsFATP5(partial) 

GTCGTTGGGATCCTCGGCTGCTTAGATCTCGGAGCCACCTGTGTTCTGGCCCCCAAG 
TTCTCTACTTCCTGCTTCTGGGA 

TGACTGTCGGCAGCATGGCGTGACAGTGATCCTGTATGTGGGCGAGCTCCTGCGATA 
CTTGTGTAACATTCCCCAGCAAC 

CAGAGGACCGGACACATACAGTCCGCCTGGCAATGGGCAATGGACTACGGGCTGAT 
GTGTGGGGAGACCTTCCAGCAGCG 

TTTCGGTCCTATTTCGGATCTNGGGAAGTCTTACGGGCTTCCACAGAAGGGCAACAT 
: GGGGCTTT AGTTC AAATATTGTT 

GGGGGCGCTGCGGGGCCCTGGGGGCAAAGATGGAGCTTGCCTCCTCCGAATGCTGT 
CCCCCTTTGAGCTGGTGCAGTTCG 

ACATGGAGGCGGCGGAGCCTGTGAGGGACAATCAGGGCTTCTGCATCCCTGTAGGG 
CTAGGGGAGCCGGGGCTGCTGTTG 

ACCAAGGTGGTAAGCCAGCAACCCTTCGTGGGCTACCGCGGCCCCCGAGAGCTGTC 
GGAACGGAAGCTGGTGCGCAACGT 

GCGGCAATCGGGCGACGTTTACTACAACACCGGGGACGTACTGGCCATGGACCGCG 
AAGGCTTCCTCTACTTCCGCGACC 

GACTCGGGGACACCTTCCGATGGAAGGGCGAGAACGTGTCCACGCACGAGGTGGAG 
GGCGTGTTGTCGCAGGTGGACTTC 

TTGCAACAGGTTAACGTGTATGGCGTGTGCGTGCCAGGTTGTGAGGGTAAGGTGGGC 
ATGGCTGCTGTGGCATTAGCCCC 

CGGCCAGACTTTCGACGGGGAGAAGTTGTACCAGCACGTTCGCGCTTGGCTCCCTGC 
CTACGCTACCCCCCATTTCATCC 

GCATCCAGGACGCCATGGAGGTCACCAGCACGTTCAAACTGATGAAGACCCGGTTG 
GTGCGTGAGGGCTTCAATGTGGGG 

ATCGTGGTTGACCCTCTGTTTGTACTGGACAACCGGGCCCAGTCCTTCCGGCCCCTG 
ACGGCAGAAATGTACCAGGCTGT 

GTGTGAGGGAACCTGGAGGCTCTGATCACCTGGCCAACCCACTGGGGTAGGGATCA 
AAGCCAGCCACCCCCACCCCAACA 

CACTCGGTGTCCCTTTCATCCTGGGCCTGTGTGAATCCCAGCCTGGCCATACCCTCA 
ACCTCAGTGGGCTGGAAATGACA 

GTGGGCCCTGTAGCAGTGGCAGAATAAACTCAGMTGYGTTCACAGAAA 



hsFATP5p artial,protein 



1 1 1 ,i 




VVG I LGCLDLGATCVLAPKFSTSCFWDOCRQHGVTV ILYV 40 
GELLRYLCN I PQQPEDRTHTVRLAMGNGLRADVWGDtPAA 80 
FRSYFGSXEVLRASTEGQHGALVQ ILLGALRGPGGKDGAC 120 
LLRMLSPFELVQFDMEAAEPVRDNQGFC IPVGLGEPGLLL 160 
TKVVSQQPFVGYRGPRELSERKLVRNVRQSGDVYYNTGOV 200 

210 220 230 240 



LAMOREGFLYFRDRLGDTFRWKGENVSTHEVEGVLSQVDF 240 
LQQVNVYGVCVPGCEGKVGMAAVALAPGQTFDGEKLYQHV 280 
RAWLPAYATPHF IR IGDAMEVTSTFKLMKTRLVREGFNVG 320 
I VVDPLFVLDNRAGSFRPLTAEMYQAVCEGTWRL 354 





hsFATP6 full lenght.DNA 



10 20 30 40 

' ■ ■ ' i ' ' ' ■ I ' ' ■ ■ i < ' ' ' I * ' < ' i ■ ' ' ■ I ' ' ' * i ■ ■ i ■ I 

AACGGCAAGTAAGCGC AACGCAATTAATGTGAGTAGCTCA 40 
CTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGG 80 
CTCGTATGTTGTGTGGAATTGTGAGCGGATACCAATTTCA 120 
CACAGGAACCAGCTATGACATGATTACGAATTTAATACGA 160 
CTCACTATAGGGAATTTGGCCCTCGAGGCCAAGAATTCGG 200 

210 220 230 240 

' 1 1 ' ^ t f t f 1 T 1 I t I t ; t I 1 ! t T I 1 f t ! I i » T 1 r I 1 t I f I 

CACGAGGGGTGCTGAGCCCCTGCGCGGTTTCTGGTGCGTA 240 
GAGACTGTAAATCGCTGCGCTTCTCAGTC ATCATCATCCC 280 
AGCTTTTCCCGGCTCGAATTCAGCCTCCAACTCAAGCTCG 320 
CGGGAAAGACTACCTGAGAGGAGAAAAGCTTCTGTCCCTG 360 
GACCTTCTTCTGAGGGTGGAGTCGGAGGCTCCCTGCTTTC 400 

410 420 430 440 

' i t t I i i l i 1 i r i i I it ii I , i , i I , , , , I , i . , I , , , , I 

CAGCCGCCCAGTGACCCAAGCTTAATCTTCAGCACCACTT 440 
GGGGCGACCTTTTCGGTGCAAACCTACGATTCTGTTTCTC 480 
AGGATTCCTCCCCATCCCGCTTCGCCCCGGAAAAGCTGAC 520 
AAGAACTTCAGGTGTAAGCCCTGAGTAGTGAGGATCTGCG 560 
GTCTCCGTGGAGAGCTGTGCCTGGAAGAGAAGGACGCTGG 600 

610 620 630 640 

i i i i 1 i i i < I r i t i I i i i i I i i i i I i i ■ i I i i i i I ■ i i i I 

TGGGGGCTGAGATCAGAGCTGTCTTCTGGCCCAGTTGCCC 640 
CCATGCTTCTGTCATGGCTAACAGTTCTAGGGGCTGGAAT 680 
GGTCGTCCTGCACTTCTTGCAGAAACTCCTGTTCCCTTAC 720 
TTTTGGGATGACTTCTGGTTCGTGTTGAAGGTGGTGCTCA 760 
TTATAATTCGGCTGAAGAAGTATGAAAAGAGAGGGGAGCT 800 

810 820 830 840 

■ ' ■ ' I — ! 1 ? ! f 1 T T t f I T T I ! i r t « t i i i t I i i i i 1 « i i t I 

GGTGACTGTGCTGGATAAATTCTTGAGTCATGCCAAAAGA 840 
CAACCTCGGAAACCTTTCATCATCTATGAGGGAGACATCT 880 
ACACCTATCAGGATGTAGACAAAAGGAGCAGCAGAGTGGC 920 
CCATGTCTTCCTGAACCATTCCTCTCTGAAAAAGGGGGAC 960 
ACGGTGGCTCTGCTGATGAGCAATGAGCCGGACTTCGTTC 1000 

1010 1020 1030 1040 

' 1 ' ' I ■ r ■ ■ 1 1 1 * ' i ■ ■ 1 ' l ' ' 1 ' 1 ■ ' ■ 1 l ' ■ ' 1 1 ' ■ ' • I 
ACGTGTGGTTCGGCCTCGCCAAGCTGGGCTGCGTGGTGGC 1040 
CTTTCTCAACACCAACATTCGCTCCAACTCCCTCCTGAAT 1080 
TGCATCCGCGCCTGTGGGCCCAGAGCCCTAGTGGTGGGCG 1 120 
CAGATTTGCTTGGAACGGTAGAAGAAATCCTTCCAAGCCT 1 160 
CTCAGAAAATATCAGTGTTTGGGGGATGAAAGATTCTGTT 1200 



Fr 4 . SHA 



hsFATP6 full lenght.DNA 



1210 1220 1230 1240 

_ j i * r I i t » t ! * i i i 1 t t t « 1 t i t t I i t t t 1 I I i I I i t t t ! 

CCACAAGGTGTAATTTCACTCAAAGAAAAACTGAGCACCT 1240 
CACCTGATGAGCCCGTGCCACGCAGCCACCATGTTGTCTC 1280 
ACTCCTCAAGTCTACTTGTCTTTAC ATTTTTACCTCTGGA 1320 
ACAACAGGTCTACCAAAAGCAGCTGTGATTAGTCAGCTGC 1360 
AGGtTTTAAGGGGTTCTGCTGTCCTGTGGGCTTTTGGTTG 1400 

1410 1420 1430 1440 

• ■ ■ ■ i ■ * ' ' I ' ' ' ' i ' ' * ■ I ' ■ ' ' i ' ' ■ ' l ■ ■ ' ■ * ■ ' ' ■ I 
TACTGCTCATGACATTGTTTATATAACCCTTCCTCTGTAT 1440 
CATAGTTCAGCAGCTATCCTGGGAATTTCTGGATGTGTTG 1 480 
AGTTGGGTGCCACTTGTGTGTTAAAGAAGAAATTTTCAGC 1520 
AAGCCAGTTTTGGAGTGACTGCAAGAAGTATGATGTGACT 1560 
GTGTTTCAGTATATTGGAGAACTTTGTCGCTACCTTTGCA 1600 

1610 1620 1630 1640 

i 1 t i 1 i * r i I * t t t 1 i i t i [ I I T 1 1 T I f I I f 1 I I I I 1 1 I 1 

AACAATCTAAGAGAGAAGGAGAAAAGGATCATAAGGTGCG 1640 

TTTGGCAATTGGAAATGGCATACGGAGTGATGTATGGAGA 1680 

GAATTTTTAGACAGATTTGGAAATATAAAGGTGTGTGAAC 1720 

TTTATGCAGCTACCGAATCAAGCATATCTTTCATGAACTA 1760 

CACTGGGAGAATTGGAGCAATTGGGAGAACAAATTTGTTT 1800 

1810 1820 1830 1840 

t t t I f T I f r 1 I I I f T I I f t 1 1 t 1 1 I 1 1 t T I I ! I 1 1 I t t i 1 

TACAAACTTCTTTCCACTTTTGACTTAATAAAGTATGACT 1840 
TTCAGAAAGATGAACCCATGAGAAATGAGCAGGGTTGGTG 1880 
TATTCATGTGAAAAAAGGAGAACCTGGACTTCTCATTTCT 1920 
CGAGTGAATGCAAAAAATCCCTTC TTTGGCTATGCTGGGC 1960 
CTTATAAGCACACAAAAGACAAATTGCTTTGTGATGTTTT 2000 

2010 2020 2030 2040 

■ ■ < i i ' ' ■ ' I ■ ' ' ' i ' ■ ■ ' I > > ■ ■ i ■ ' ■ ■ I < ' ■ ' I 1 ' ' ' I 

TAAGAAGGGAGATGTTTACCTTAATACTGGAGACTTAATA 2040 
GTCCAGGATCAGGACAATTTCCTTTATTTTTGGGACCGTA 2080 
CTGGAGACACTTTCAGATGGAAAGGAGAAAATGTCGCAAC 2120 
CACTGAGGTTGCTGATGTTATTGGAATGTTGGATTTCATA 2160 
CAGGAAGCAAACGTCTATGGTGTGGCTATATCAGGTTATG 2200 

2210 2220 2230 2240 

■ ' ' ' I ■ ' ' ■ I ' ' ' ' ! ' ■ ' ' I ' ' ' ' I ' ' ' ' 1 ' ' ' ' I ' ' ' ' I 

AAGGAAGAGCAGGAATGGCTTCTATTATTTTAAAACCAAA 2240 
TACATCTTTAGATTTGGAAAAAGTTTATGAACAAGTTGTA 2280 
ACATTTCTACCAGCTTATGCTTGTCCACGATTTTTAAGAA 2320 
TTCAGGAAAAAATGGAAGCAACAGGAACATTC AAACTATT 2360 
GAAGCATCAGTTGGTGGAAGATGGATTTAATCCACTGAAA 2400 

2410 2420 2430 2440 

t 1 T T 1 1 I 1 ! 1 I T 1 I I 1 I T t 1 I T 1 t ! I 1 I 1 [ f T T t f I t 1 t 1 

ATTTCTGAACCACTTTACTTCATGGATAACTTGAAAAAGT 2440 
CTTATGTTCTACTGACCAGGGAACTTTATGATCAAATAAT 2480 
GTTAGGGGAAATAAAACTTTAAGATTTTTATATCTAGAAC 2520 
TTTCATATGCTTTCTTAGGAAGAGTGAGAGGGGGGTATAT 2560 
GATTCT TTATGAAATGGGGAAAGGGAGCTAACATTAATTA 2600 



hsFATP6 full lenghtDNA 



2610 2620 2630 2640 

» ' * » t tttil ttrtt irtil Titiltirtt f i i i 1 _ 

TGCATGTACTATATTTCCTTAATATGAGAGATAATTTTTT 2640 
AATTGCATAAGAATTTTAATTTCTTTTAATTGATATAAAC 2680 
ATTAGTTGATTATTCTTTTTATCTATTTGGAGATTCAGTG 2720 
CATAACTAAGTATTTTCCTTAATACTAAAGATTTTAAATA 2760 
ATAAATAGTGGCTAGCGGTTTGGACAATCACTAAAAATGT 2800 

2810 2820 2830 2840 

' ' ■ ■ 1 ■ 1 ■ 1 I 1 ■ ' 1 1 ' ' ■ ' I ■ ■ ' ' 1 ■ 1 ' ■ I ■ ' ■ ' i ■ ■ ■ ■ I 

ACTTTCTAATAAGTAAAATTTCTAATTTTGAATAAAAGAT 2840 
TAAATTTTACTGAAAAAAAAAAAAAAAAAAAAAATTGGCG 2880 
GCCGC 2885 



Ft*- 



hsFATP6 full lenght.protein 



f 



10 20 30 40 

.... 1 . . ■ ■ I t 1 I ■ I 1 I I ! I 1 1 1 1 I I I I I 1 I I I I I I I ' I I 

MLLSWLTVLGAGMVVLHFLQKLLFPYFWDDFWFVLKVVLI 40 
I IRLKKYEKRGELVTVLDKFLSHAKRQPRKPF 1 1 YEGD I Y 80 
T.YQDVDKRSSRVAHVFLNHSSLKKGDTVALLMSNEPDFVH 120 
VWFGLAKLGCVVAFLNTN IRSNSLLNC I RACGPRALVVGA 160 
DLLGTVEEI LPSLSEN I SVWGMKDSVPQGV I SLKEKLSTS 200 
210 220 230 240 

'_ } i , i 1 r . . r t t i . i I i t t . ] t i < i I t f 1 I 1 1 I 1 I 1 T t t T 



PDEPVPRSHHVVSLLKSTCLY I FTSGTTGLPKAAV I SQLQ 240 
VLRGSAVLWAFGCTAHD I VY I TLPLYHSSAA I LG I SGC VE 280 
LGATCVLKKKFSASQFWSDCKKYDVTVFGY I GELCRYLCK 320 
QSKREGEKDHKVRLAI GNG I RSDVWREFLDRFGNIKVCEL 360 
YAATESS ISFMNYTGR IGAI GRTNLFYKLLSTFDLIKYDF 4'00 

410 420 430 440 

■ ■ ■ . t . ■ ■ ■ I > ■ ■ ■ ' ■ ■ ' ■ I ' ' ' ' I ■ ' ' ' I ' ' ' ' T ' ' ' 1 1 



QKDEPMRNEQGWC I HVKKGEPGLL I SRVNAKNPFFGYAGP 440 
YKHTKDKLLCDVFKKGDVYLNTGDLI VQDQDNFLYFWDRT 480 
GDTFRWKGENVATTEVADV I GMLDF I GEANVYGVA ISGYE 520 
GRAGMASI ILKPNTSLDLEKVYEQVVTFLPAYACPRFLRI 560 
QEKMEATGTFKLLKHQLVEDGFNPLK ISEPLYFMDNLKKS 600 

610 620 630 640 

■ ■ . ■ I . , . ■ 1 ■ i . . I ■ ■ i . I ■ < ■ H I I I I I t I 1 t I 1 I t I ) 



YVLLTRELYDQIMLGEIKL. 620 



mFATPl full length .DN A 



10 20 30 40 
■ i i i i ii'il i i i t i < ■ i i l i i i i i i i i i l i i i i i i ■ i t I 

AAGTTCCCACTCCAGACTTCTGCGAGAACCCGTGAGGAAG 40 
CAGCGAGAACCGGGGGTTTGC AAGCCAGAGAAGGATGCGG 80 
ACTCCGGGAGCAGGAACAGCCTCTGTGGCCTCATTGGGGC 120 
TGCTTTGGCTTCTGGGACTTCCGTGGACCTGGAGCGCGGC 160 
GGCGGCGTTCGGTGTGTACGTGGGTAGCGGTGGCTGGCGA 200 

210 220 230 240 

i » r r T t i i t I i t i t t i l i i I 1 t r t I l i 1 T I I I I I 1 I I ! I 1 

TTTCTGCGTATCGTCTGCAAGACGGCGAGGCGAGACCTCT 240 
TTGGCCTCTCTGTTCTGATCCGCGTGCGGCTAGAGCTACG 280 
ACGACACCGGCGAGCAGGAGACACGATCCCACGCATCTTC 320 
CAGGCCGTGGCCCAGCGACAGCCGGAGCGCCTGGCGCTGG 360 
TAGATGCGAGTAGCGGTATCTGCTGGACCTTCGCACAGCT 400 

410 420 430 440 

I 1 t I ITIll I I I T ? I t 1 I I lltll tlllf lltfl tftll 

AGACACCTACTCCAATGCTGTGGCCAATCTGTTCCTCCAG 440 
CTGGGCTTTGCGCCAGGCGATGTGGTGGCTGTGTTCCTGG 480 
AAGGCCGGCCCGAGTTCGTGGGACTGTGGCTGGGCCTGGC 520 
CAAGGCCGGTGTAGTGGCTGCGCTTCTCAATGTCAACCTG 560 
AGGCGGGAGCCCCTTGCCTTCTGCTTGGGCACATCAGCTG 600 

610 620 630 640 

i i t t \ t i 1 i t i f i r I r t t i f r ! t r T i t t t 1 [ f 1 f f 1 1 T T ! 

CCAAGGCCCTCATTTATGGCGGGGAGATGGCAGCGGCGGT 640 
GGCGGAGGTGAGTGAGCAGCTGGGGAAGAGCCTGCTCAAG 630 
TTCTGCTCTGGAGATCTGGGGCCTGAGAGCGTCCTGCCTG 720 
ACACGCAGCTTCTGGACCCC ATGCTTGCTGAGGCGCCCAC 760 
CACACCCCTGGCACAGGCCCCAGGCAAGGGCATGGATGAT 800 

810 820 830 840 

* t * r 1 r i i i [ i i i i I i t i i t i t t i I f ! t l I 1 I I 1 I 1 1 f I I 

CGGCTATTTTACATCTATACTTCTGGGACCACCGGACTTC 840 
CTAAGGCGGCCATTGTGGTGCACAGCAGGTACTACCGCAT 880 
CGCAGCCTTCGGCCACCATTCCTACAGCATGCGGGCCAAC 920 
GATGTGCTCTATGACTGCCTACCTCTCTACC AC TCAGCAG 960 
GGAACATCATGGGCGTGGGACAGTGTATCATCTACGGGTT 1000 

1010 1020 1030 1040 

i ' * t t t t t t I r i f f ! i i i t 1 r t t t I r t t t 1 f f t t I t i T f f 

AACGGTGGTACTGCGCAAGAAGTTCTCCGCCAGCCGCTTC 1040 
TGGGACGACTGTGTCAAATATAATTGCACGGTAGTGCAGT 1080 
ACATCGGTGAAATATGCCGCTACCTGCTAAGGCAGCCGGT 1 120 
TCGCGATGTAGAGCGGCGGCACCGCGTGCGCCTGGCCGTG 1 160 
GGTAACGGACTGCGGCCAGCCATCTGGGAGGAGTTCACGC 1200 



rnFATPI full length .DN A 



1210 1220 1230 1240 

■ • ■ ■ t ■ ■ ■ ■ I ■ ' ■ ■ i ■ ■ » ' i ■ ' ' ■ 1 ' * ' ■ I ' ' ' 1 1 ' ' ' 1 1 

AGGGTTTCGGTGTGCGACAGATTGGCGAGTTCTACGGCGC 1240 
CACCGAATGCAACTGCAGCATTGCCAACATGGACGGCAAG 1280 
GTCGGCTCCTGCGGCTTCAACAGCCGTATCCTCACGCATG 1320 
TGTXCCCCATCCGTCTGGTCAAGGTCAACGAGGACACGAT 1360 
GGAGCCACTGAGGGACTCCCAAGGCCTCTGCATCCCGTGC 1400 

1410 1420 1430 1440 

, . i i 1 ■ i i t I i t i t I » i t i I i i r t I i i t i I i r i t I f r i I I 

CAGCCCGGGGAACCTGGGCTTCTCGTGGGCCAGATCAACC 1440 
AGCAAGACCCTCTGCGGCGCTTCGATGGCTATGTTAGTGA 1480 
CAGCGCCACCAACAAGAAGATTGCCCACAGCGTGTTCCGA 1520 
AAGGGGGACAGCGCCTACCTTTCAGGTGACGTGCTAGTGA 1560 
TGGACGAGCTGGGGTACATGTACTTCCGTGACCGCAGCGG 1600 

1610 1620 1630 1640 

i ■ i ■ i ■ ■ ' ■ I i ' * ' i ' ' » ■ I ' ■ ■ ' i ' ' ' ■ 1 ■ ' ■ ' 1 ' ' ' ' I 

GGATACCTTCCGATGGCGCGGCGAGAACGTATCCACCACG 1640 
GAGGTGGAAGCCGTGCTGAGCCGCCTGTTGGGCCAGACGG 1680 
ACGTGGCTGTGTATGGAGTGGCTGTGCCAGGAGTGGAGGG 1720 
GAAAAGCGGCATGGCGGCCATTGCAGACCCCCACAACCAG 1760 
CTGGACCCTAACTCAATGTACCAGGAATTGCAGAAGGTTC 1800 

1810 1820 1830 1840 

f'f^T tT^fT tfftl tlffl tlftl fTTti I T f t I T I T T [ . 

TTGCATCCTATGCCCAGCCCATCTTCCTGCGTCTTCTGCC 1840 
CCAAGTGGATACAACAGGCACCTTCAAGATCCAGAAGACC 1880 
CGACTACAGCGTGAAGGCTTTGACCCCCGCCAGACCTCAG 1920 
ACCGGCTCTTCTTTCTAGACCTGAAACAGGGACGCTACCT 1960 
ACCCCTGGATGAGAGAGTCCATGCCCGCATCTGCGCAGGC 2000 

2010 2020 2030 2040 

r t t i I i t i i I t t i t f i t i i i f t i i t i t i t I t r t t F i i i i ! 

GACTTCTCACTCTGAGCCTGGTGAGTGGGATGGCCCTGGA 2040 
CTTGTGAGACCAGGGAGCCGGACACCCCTGTTCAGGTGT I 2080 
TCTCCTGCCTGGCCACGTGGCCAGCAGCACCTGTGGGTGC 2120 
AGGAAACTGGAACCTGAGTGGCCGGGTGTCCCTTTCCTAC 2160 
AACCCACCATGCACACATCTAGCCTCTGCCTTGGTCTTT I 2200 

2210 2220 2230 2240 

iirtl riiff ii)t! ittif )trt! i f t f 1 itTfltfTl! 

TCTCCATCTCTTTCCTCCGTGCCCAGCAGGAGCCCCACAG 2240 
ACACATTGGCTGCTGTGTCCTGCAGTGGGACCGGTGTCTA 2280 
GGGGTCCATGCTGCAGGCTGTGACCCGCACTGGTGCCCAC 2320 
CTCCCTTCCCCATTGTGCCTTAGGTTCCTCCACTGTGCGC 2360 
CGGTGAAGCAAGTGGGGACCCACATAGCTGTTGTCCCTGC 2400 

2410 2420 2430 2440 
« ■ ' t i . f . i I ■ i > i i i i > ■ l i t i i i i ■ ■ i I i t i i i i i i i 1 — . 

TGAGGGTTGGTAGCAAATGCACCCTCATGTCAGCTGGGAG 2440 
ACACATGCAGTCTCCCACTGACCCCCAATCAACTGAAGAT 2480 
ACTGTTTTGTATTATTGTTTTGAGATAGGGTCTCACTGTG 2520 
GAGGCCAAGCTGGCCTCAGGCTCACCACTCTACTGCCTCC 2560 
GGGCACCAGCCTGCAGTTTGATGACATGTATGCACTATTG 2600 



rnFATPI full length.DNA 



2610 2620 2630 2640 

i i i t i i i i i I i i i i I » i i i i t t r t 1 r i i r 1 t t i t 1 t i i i T 

TTCTAAGGGTCTTCTGAGTCCCTGCTTTCCCCTCATGTCC 2640 
TAAAACCTTCCAGAACTGACTCTGATCACTTGGATGTAGC 2680 
TAGTGTTGGCCCTGCCCACGTGTGTCAATTCAGGGGTCCC 2720 
CAGGCATCATCTCTGGAGGCCCTAACCTTGGCAAAGCTTG 2760 
GATGTCCTCACATCACAGCAGGAGACCCAGGAAGGTTGCT 2800 
2810 2820 2830 2840 

t T * » 1 * t t * t 1 1 I 1 f I 1 t I I « t 1 1 I I I t t I I t 1 l I t I I !■ I 

GTGGTGTCTCTTGGGCACCCCTGGCGGCAGCCGTGGACAT 2840 
GCTTCCCTGCTGTGATAGCCCAAACTGTTGCCTATGACAT 2880 
TTGAGGTCTACCCTTCTGGCTGCCATGGTCCCCATTGAGA 2920 
TCTTTGGTGACTCACCTCAGCCACCAAGCCAGGCCTCTGC 2960 
CTTCCTTCAGCTCTAAGGGCATGAAGGGTGTGGACAGAGC 3000 

3010 3020 3030 3040 

■ ■ » ■ i ■ ' ■ ' I ■ * ' ■ i ■ ■ ' ■ t * ' ■ ■ i ' ' ■ ■ I ' ■ ' ' i ■ • < • * 

AGCCACAGGCTGCCCACAGTCACCCACATGCAAGTGTTAT 3040 
TTCCTTGTTTGTTTTAAAAAAATAAACATGCTGAGCCTTG 3080 
AAAAAAAAAAAAAAAAAA 3098 



rnFATPI full lenght.protein 





MRTPGAGTASVASLGLLWLLGLPWTWSAAAAFGVYVGSGG 40 
WRFLR I VCKTARRDLFGLSVLI RVRLELRRHRRAGDT I PR 80 
I FQAVAQRQPERLALVDASSG I CWTFAQLDTYSNAVANLF 120 
. LQLGFAPGDVVAVFLEGRPEFVGLWLGLAKAGVVAALLNV 160 
■ NILRREPLAFCLGTSAAKALI YGGEMAAAVAEVSEQLGKSL 200 



LKFCSGDLGPESVLPDTGLLDPMLAEAPTTPLAQAPGKGM 240 
DDRLFYI YTSGTTGLPKAA! VVHSRYYR I AAFGHHSYSMR 280 
ANDVLYDCLPLYHSAGN IMGVGQC I I YGLTV VLRKKFSAS 320 
RFWDDCVKYNCTVVQY I GE I CRYLLRQPVRDVERRHRVRL 360 
AVGNGLRPA IWEEFTQGFGVRQIGEFYGATECNCSIANMD 400 

410 420 430 440 



GKVGSCGFNSR ILTHVYP I RLVKVNEDTMEPLRDSQGLC I 440 
PCQPGEPGLLVGQ I NQQDPLRRFDGYVSDSATNKK I AHSV 480 
FRKGOSAYLSGDVLVMDELGYMYFRDRSGDTFRWRGENVS 520 
TTEVEAVLSRLLGQTDVAVYGVAVPGVEGKSGMAAI ADPH 560 
NQLDPNSMYQELQKVLASYAQP I FLRLLPQVDTTGTFK I Q 600 

610 620 630 640 



; KTRLQREGFDPRQTSDRLFFLOLKQGRYLPLDERVHARIC 640 
| AGDFSL. 647 







t 1 t 1 I 



fca. 57 



rnVLACS(FATF2)fuH length, DNA 



/ 

; 

10 20 30 40 

? * t t 1 riitl triij rtttl rtttl t r t t ! ttttt tttil 

GACACAGTACTGCCGATGTTGQACAGAGGATCGCTTAACA 40 
GAACGAAATCTCAAAACAAATTAACAGGACCCGGTTGCTT 80 
GATTTCCCAAATCAGAAAAGGCTCGAAATGTCTAGAGGGG 120 
CTGACTGATGCAGCGGTGACCCGGACTGGAGACAGTTGGA 160 
CGCGATCATCTCTGGTGCTTTTGTTCAACCTTGAAACCTT 200 

210 220 230 240 

' ■ ' ' ' ' ' ■ » I ■ < ■ ' t ' ' ' ' 1 ' ■ ' ■ i ' ' ■ ' I ' ' ■ ' i ■ ' > ■ i 

CGCCACAGGAGACTTGCCTGAGCAGAGAAGCAAACGTGGA 240 
GAAACAAAGAGAGATCTAGCGAAAAGCCTCTGGGACCAAG 280 
GAGGGGAGGTGGGACTCTGGGTTGGCGGTGGCACCTGCTG 320 
CCGGCTATTAATAATAGGGTCGCGATGCGTTTATAAGGTG 360 
TTTGATTAAACAAAGACTCTATGAGAGAAGAATAACTAGC 400 

410 420 430 440 

' > r i f i t t f 1 i > i t I i t t r ! i t t i I t r r t 1 f f f i 1 r I r i t 

AACAGCCCCACGTCTGAGTCGTCGCCTCCGACCTTTTTCA 440 
ACGTGGGTTCTTTGGGCCGAGCGTCGTTTGCCGAGAACTA 480 
GATCTCACCTGACCCCAGACGCTGAAAACAAGCGCTGTGG 520 
CATCCTGGGCCACCCAAGCTGACAAGGGCGCGCCCCCTGA 560 
GCACACGAGGTGCCCCACGAGGGGGAGGGACCCACAGCCG 600 

610 620 630 640 

■ ' ■ ' 1 ■ 1 ' 1 I ' 1 ■ ■ I ' ' ' ' I 1 ■ 1 ' 1 1 ' * 1 I ■ ' 1 ' i ■ ■ 1 1 I 

TCCCGCCCGCACCGCGGTGTCCGCTGCGGGCACCTGCAGC 640 
CGAGCCGCC ACCCGCAGTCGCAGCGCGTCCGGCGGCCGAA 680 
CCCGGTCGTCAGCTCGTCAGCACCTGCTCTGCTTCTCTCC 720 
CGCCCGCCGCCGCGCTGCACGCCTCGAGCGCTCCCTCGGC 760 
CCCGGCGGGGACCGGGGACCCCGCAGCCACCGCCATGCTG 800 

810 820 830 840 

' t 1 t t T t I I 1 i T I t \ 1 1 T 1 1 1 1 1 I t T > T t I I T t I f 1 t t I I 

CCTGTGCTCTACACCGGCCTGGCGGGGCTGC TGCTGCTGC 840 
CTCTGCTGCTCACCTGCTGCTGCCCCTACCTCCTCCAGGA 880 
CGTGCGGTTCTTCCTGCAACTGGCCAACATGGCCCGGCAG 920 
GTGCGCAGCTACCGGC AGCGGCGACCCGTGCGCACCATCC 960 
TGCATGTCTTCTTGGAGCAAGCGCGCAAGACCCCGCACAA 1000 

1010 1020 1030 1040 

* f T ' 1 t ( t l I t r . . [ , t t i I , t t 1 1 1 ) t i 1 1 1 t i 1 f 1 T i j 

GCCCTTCCTGCTGTTTCGCGACGAGACGCTTACCTACGCC 1040 

CAGGTAGACCGGCGCAGCAACCAAGTAGCGCGAGCGCTGC 1080 

ATGATCACCTGGGCCTGCGGCAGGGGGATTGCGTGGCCCT 1 120 

CTTCATGGGCAATGAGCCGGCCTACGTGTGGCTCTGGCTG 1 160 

GGACTGCTCAAACTGGGCTGTCCCATGGCGTGCCTCAACT 1200 



mVLACS(FATP2)fuH ienqth.PNA 



1210 1220 1230 1240 

f 1 ' 1 I ■ ■ i I I t < t i ! i i i i ! i i I t I r I i i I i I i i I I I i i I 

ACAACATCCGTGCCAAGTCTCTGCTACACTGCTTTCAGTG 1240 

CTGCGGGGCGAAGGTGCTGCTGGCCTCCCCAGAGCTACAC 1280 

GAAGCTGTCGAGGAGGTTCTTCCAACCCTGAAAAAGGAGG 1320 

GCGTGTCCGTCTTCTACGTAAGCAGAACTTCTAACACTAA 1360 

TGGCGTGGACACAGTACTGGACAAAGTAGACGGGGTGTCG 1400 

1410 1420 1430 1440 

' ' ' ' I t i i I I I > I i t I I I t I I t I I I I t I r I , I , I I I I I I I 

GCGGACCCCATCCCGGAGTCGTGGAGGTCTGAAGTCACGT 1440 

TCACCACACCCGCAGTCTACATATATACTTCGGGCACCAC 1480 

AGGTCTTCCAAAGGCTGCAACCATTAATCACCATCGCCTC 1520 

TGGTATGGGACCAGCCTTGCCCTGAGGTCCGGAATTAAGG 1560 

CTCATGACGTCATCTACACCACCATGCCCCTGTACCACAG 1600 

1610 1620 1630 1640 

.1 t I t t T 1 I f I 1 > T 1 t t T t t I I t I f f I I I f I I 1 T T I f f t I I 

CGCGGCGCTCATGATTGGCCTCCACGGATGCATTGTGGTT 1640 
GGGGCTACATTTGCTTTGCGGAGCAAATTTTCAGCCAGCC 1680 
AGTTTTGGGACGACTGCAGGAAATACAACGCCACTGTCAT 1720 
TCAGTACATCGGTGAACTGCTTCGGTACCTCTGCAACACG 1760 
CCCCAGAAACCAAATGACCGGGACCACAAAGTGAAAATAG 1800 

1810 1820 1830 1840 

..1 1 ! t 1 ! 1 1 T I f T T I I T f 1 I 1 T t T T 1 1 1 T 1 f T \ 1 1 I 1 1 t ! I 

CACTAGGAAATGGCTTACGAGGAGATGTGTGGAGAGAGTT 1840 
CATCAAGAGATTTGG'GGACATTCACATTTATGAGTTCTAC 1880 
GCTTCCACTGAAGGCAACATTGGATTTATGAACTATCCAA 1920 
GAAAAATCGGAGCTGTTGGAAGAGAAAATTACCTACAAAA 1960 
AAAAGTTGTAAGGCACGAGCTGATCAAGTATGACGTGGAG 2000 

2010 2020 2030 2040 

.* i i i I t r i ^ t t i i t t i t i r ! t * i i ! t i i t 1 i i i * I » i i i f 

AAGGATGAGCCTGTCCGTGATGCAAATGGATATTGCATCA 2040 
AAGTCCCCAAAGGAGAGGTTGGACTCTTGATTTGCAAAAT 2080 
CACAGAGCTCACACCATTTTTTGGCTATGCTGGAGGAAAG 2120 
ACCCAGACAGAGAAGAAAAAGCTCAGAGATGTTTTTAAGA 2160 
AAGGAGACGTCTACTTCAACAGTGGCGATCTCCTGATGAT 2200 

2210 2220 2230 2240 

,1 1 t T 1 \ , 1 ? 1 1 1 1 1 I I T 1 ! I 1 1 T 1 I I t F T f 1 1 1 1 t I T T 1 1 I 

CGACCGTGAAAATTTCATCTATTTTCACGACAGAGTTGGA 2240 
GACACCTTCCGGTGGAAAGGAGAGAATGTAGCTACCACGG 2280 
AAGTCGCTGACATTGTGGGACTGGTAGATTTTGTTGAAGA 2320 
AGTGAATGTTTACGGTGTGCCCGTGCCAGGTCATGAAGGT 2360 
CGCATCGGGATGGCCTCGATCAAGATGAAAGAAAACTACG 2400 

2410 2420 2430 2440 

* Tt.I t__t T > I t 1 t 1 I * j < I > T j t t I 1 I 1 > I t 1 I I I I 1 

AGTTCAATGGAAAGAAACTCTTTCAGCACATCTCGGAGTA 2440 
CCTGCCCAGTTACTCGAGGCCTCGGTTCCTGAGAATACAA 2480 
. GATACCATTGAGATC ACCGGGACTTTTAAACACCGCAAAG 2520 
TGACCCTGATGGAAGAGGGCTTTAACCCCTCAGTCATCAA 2560 
AGATACCTTGTATTTCATGGATGACAC AGAAAAAACATAC 2600 



Fa. 52% 



mVl_ACS(FATP2)full length.DNA 




2640 



GTGCCCATGACTGAGGACATTTATAATGCCATAATTGATA 2640 
AGACTCTGAAGCTCTGAATGTTGCCTGGCTCCTAACACTT 2680 
CCAGAAAGAAACACAATAGGCCTAGCATAGCCCCTTCACA 2720 
■TGTGTAATCCAACTTTAACTTGATTAAAGGTTATAGGTGT 2760 
GATfTTTCCTAGGAAATTATTCATTTAAAGGACAATTGTT 2800 

2810 2820 2830 2840 



TGTTTGTTTGTTTGTTTTTTATTAATTACACCAGAACGTT 2840 
TGCAAGTAAAAAGATTTAAAGTCACT TATTTTTCAATGTG 2880 
CACCTGCCATTTGTCCTTGCAAACTTAGCTTCTTGGAGAG 2920 
AGGGCCTTATTTTTTTAAAGACATAATAAACTATGTAAAC 2960 
ACT 2963 



I > ' ■ ' i ■ ■ ' * l ' ■ * ' ' ' ■ ' 



tilt 




rnVLACS(FATP2)fuI! length.prot 



10 20 30 40 

* * * * ^ tlltfllllT Ttllitttt^ltlll Till! fttll 

MLPVLYTGLAGLLLLPLLLTCCCPYLLQDVRFFLQLANMA 40 
RQVRSYRQRRPVRT I LHVFLEQARKTPHKPFLLFRDETLT 80 
YAQVDRRSNQVARALHOHLGLRQGDCVALFMGNEPAYVV/l 120 
WLGLLKLGCPMACLNYN I RAKSLLHCFQCCGAKVLLASPE 160 
LHEAVEEVLPTLKKEGVSVFYVSRTSNTNGVDTVLDKVDG 200 

210 220 230 240 

' ' ' • i ' 1 1 ■ I ' ' ■ ■ I ' 1 ■ ■ I ' ' ' ' 1 ■ ■ ' ■ I * ■ ' ■ i » ' ' ' I 

VSADP I PESWRSEVTFTTPAVYI YTSGTTGLPKAATINHH 240 
RLWYGTSLALRSG I KAHDV I YTTMPLYHSAALli I GLHGC I 280 
VVGATFALRSKFSASQFWDDCRKYNATV IQY I GELLRYLC 320 
NTPQKPNDRDHKVK I ALGNGLRGDVWREF I KRFGD I H I YE 360 
FYASTEGN IGFMNYPRK I GAVGRENYLQKKVVRHEL I KYD 400 

410 420 430 440 

' ■ 1 ■ 1 ' ' ■ » * ■ * < < * ■ ■ » 1 I < ■ * ■ 1 ■ ' ■ ■ I ■ < ■ ■ i ' ' ■ ■ I 

VEKDEPVRDANGYC I K VPKGEVGLL I CK I TELTPFFGYAG 440 
GKTQTEKKKLRDVFKKGDVYFNSGDLLM IDRENF i YFHDR 480 
VGDTFRWKGENVATTEVAD I VGLVDFVEEVNVYG VP VPGH 520 
EGR I GMAS I KMKENYEFNGKKLFQH I SEYLPSYSRPRFLR 560 
I QOT I E I TGTFKHRKVTLMEEGFNPSV IKDTLYFMDDTEK 600 

610 620 630 640 

i i t i T i i i i I i i f i I i i i i i i i f i I i r i i 1 t i i i I i > i t I 

TYVPMTED I YNAI IDKTLKL. 621 



rnFATP4 partial.DNA 



/ 10 20 30 40 

; i i i I ' i ■ i I i ■ ■ ■ 1 ■ ' ' ' I ' ' ' 1 I 1 ' ' ' I ' ' ' ' I ' 1 ' ' I 

GATCAGCTCTTCTATATCTAC ACGTCGGGCACCACGGGGC 40 
TACCCAAAGCTGCCATTGTGGTGCACAGCAGGTAT l ACCG 80 
AATGGCTGCCCTGGTGTACTATGGATTCCGCATGCGGCCT 120 
GATGACATTGTCTATGACTGCCTCCCCCTCTACCACTCAG 160 
CAGGAAACATTGTGGGGATTGGCCAGTGCGTACTCCACGG 200 

210 220 230 240 

i ■ i . i < • • • I i ■ ' ■ ' ' ' ■ ' 1 ' ' ■ 1 i 1 > ' ■ I ' ' 1 ' I 1 ' 1 1 I 

CATGACTGTGGTGATCCGGAAGAAGTTTTCAGCCTCCCGG 240 
TTCTGGGATGACTGTATCAAGTACAACTGCACAATTGTAC 280 
AGTACATTGGTGAGCTTTGCCGCTACCTCCTGAACCAGCC 320 
ACCCCGTGAGGCTGAGTCTCGGCACAAGGTGCGCATGGCA 360 
CTGGGCAACGGTCTCCGGCAGTCCATCTGGACCGACTTC I 400 

410 420 430 440 

i > , I ) 1 t t 1 ! f t T t 1 t 7 1 I I > t 1 I I 1 1 T f 1 T T T T 1 I T T I 1 

CCAGCCGTTTCCACATTCCCAAGGTGGCCGAGTTCTACGG 440 
GGCCACCGAGTGCAACTGTAGCTTGGGCAACTTTGACAGC 480 
CAGGTGGGGGCCTGTGGCTTCAATAGCCGCATCCTGTCC I 520 
TTGTGTACCCCATCCGCTTGGTACGAGTCAATGAGGATAC 560 
CATGGAACTGATCCGGGGACCCGATGGCGTCTGCATTCCC 600 

610 620 630 '640 

. i i . i i ■ ■ i i i ■ ■ i i ■ ' ■ ' I i ■ ■ ■ i ' ' ■ ' l ' ■ ' ' i ' ' ■ ' I 

TGTCAACCAGGCCAGCCAGGCCAGCTGGTGGGTCGCATCA 640 
TCCAGCAGGACCCCCTACGCCGTTTTGATGGCTACCTCAA 680 
CCAGGGTGCCAACAACAAGAAGATTGCTAGTGATGTCTTC 720 
AAGAAAGGGGACCAAGCCTACCTCACTGGTGACGTGCTGG 760 
TGATGGATGAGCTGGGCTACCTGTACTTCCGAGACCGCAC 800 

810 820 830 840 

, i t t I t i 1 t 1 t t t T t . f t t I r T i f I .till t t t T t t 1 1 '. I 

AGGGGACACGTTCCGCTGGAAAGGGGAGAATGTGTCTACC 840 
ACTGAAGTGGAGGGCACACTC AGCCGCCTGCTTC AGATGG 880 
CAGATGTGGCTGTTTATGGTGTTGAGGTGCCAGGAGCTGA 920 
GGGCCGAGCAGGAATGGCTGCTGTGGCAAGCCCCACTAGC 960 
AACTGTGACCTGGAGAGCTTTGC ACAGACCTTGAAAAAGG 1000 

1010 1020 1030 1040 

■ » ' » ' .... 1 ■ , . . I .... 1 , t . i T t < . i 1 t t i i f i i t t I 

AGCTGCCCCTGTACGCCCGCCCCATCTTCCTCCGCTTCTT 1040 

GCCTGAGCTGCACAAAACAGGAACCTTCAAGTTCCAGAAG 1080 

ACAGAGTTGCGGAAGGAGGGCTTTGACCCGTCTGTTGTGA 1 120 

AAGACCCACTCTTCTATTTGGATGCCCGGACAGGCTGCTA 1 160 

TGTTGCACTGGACCAAGAGGCCTATACCCGCATCCAGGCA 1200 



rnFATP4 partial.DNA 



1210 1220 1230 1240 

j t i t I t * t » f t i j,.,; | t t t t 1 i i t i I r ! i i t i i t f ? , i t t i j 

GGCGAGGAGAAGCTGTGATTTCCCCCACATCCCTCTGAGG 1240 
GCCAGAGGATGCTGGATTCAGAGCCCCAGCTTCCACTCCA 1280 
GAAGGGGTCTGGGCAAGGCCAGACCAAAGCTAGCAGGGCC 1320 
CGCACCTTCACCCTAGGTGCTGATCCCCCT 1350 



fa. 60S 



rnFATP4partia!.DNA 




DQLFY I YTSGTTGLPKAA I VVHSRYYRMAALVYYGFRMRP 40 
DD I VYDCLPLYHSAGN I VG I GQCVLHGMTV V I RKKFSASR 80 
FWDDC IKYNCTI VQY IGELCRYLLNQPPREAESRHKVRMA 120 
LGNGLRQSIWTDFSSRFH I PKVAEFYGATECNCSLGNFDS 160 
QVGACGFNSR I LSFVYP I RLVRVNEDTMELI RGPDGVC I P 200 

210 220 230 240 



CQPGQPGQLVGR I I QGDPLRRFDGYLNQGANNKK I ASDVF 240 
KKGDQAYLTGDVLVMDELGYLYFRDRTGDTFRWKGENVST 280 
TEVEGTLSRLLQMADVAVYGVEVPGAEGRAGMAAVASPTS 320 
NCDLESFAQTLKKELPLYARP I FLRFLPELHKTGTFKFQK 360 
TELRKEGFOPSV VKOPLFYLDARTGCYVALDQEAYTR IQA 400 

410 420 ' 430 440 






mmFATPI full length.DNA 



f 

10 20 30 40 

' ' ' ' t ' ■ ' ' I ' ' ' ' ' ' ' ' ' I ■ ' ' ■ I ■ r t . I i i i i t i . , , 1 

ATGCGGGCTCCTGGAGCAGGAACA6CCTCTGTGGCCTCAC 40 
TGGCGCTGCTTTGGTTTCTGGGACTTCCGTGGACCTGGAG 80 
CGCGGCGGCGGCGTTCTGTGTGTACGTGGGTGGCGGCGGC 120 
TGGCGCTTTCTGCGTATCGTCTGCAAGACGGCGAGGCGAG 160 
ACCTCTTTGGCCTCTCTGTTCTGATTCGTGTTCGGCTAGA 200 

210 220 230 240 

■ ' ' ' I ■ ' ' ' I ' ' ' ' i ■ ■ ' * i ■ ■ ■ t i ■ t t i i i i i i i i i , , [ 

GCTGCGACGACACCGGCGAGCAGGAGACACGATCCCGTGC 240 
ATCTTCCAGGCTGTGGCCCGGCGACAACCAGAGCGCCTGG 280 
CACTGGTGGACGCCAGTAGTGGTATATGCTGGACCTTCGC 320 
ACAGCTGGACACCTACTCCAATGCTGTAGCCAACCTGTTC 360 
CGCCAGCTGGGCTTTGCACCAGGCGATGTGGTGGCTGTGT 400 

410 420 430 440 
» ' ' ' I ■ ' ' ' I ' < ' ■ i < ■ ' » I ■ ' ' ■ i ■ ' ■ ■ 1 ■ ■ ■ ■ ' ■ ■ ■ ■ I 

TCCTGGAGGGCCGGCCGGAGTTCGTGGGACTGTGGCTGGG 440 
CCTGGCCAAGGCCGGTGTGGTGGCTGCTCTTCTCAATGTC 480 
AACCTGAGGCGGGAGCCCCTGGCCTTCTGCCTGGGCACAT 520 
CAGCTGCCAAGGCCCTCATTTATGGCGGGGAGATGGCAGC 560 
GGCGGTGGCGGAGGTGAGCGAGCAGCTGGGGAAGAGCCTC 600 

610 620 630 640 

T I I I I I I 1 I 1 1 1 I I I I I 1 t I II It! I I I I I f i t t I I I t I I 

CTCAAGTTCTGCTCTGGAGATCTGGGGCCTGAGAGCATCC 640 
TGCCTGACACGCAGCTCCTGGACCCCATGCTTGCTGAGGC 680 
GCCCACCACACCCCTGGCACAAGCCCCAGGCAAGGGCATG 720 
GATGATCGGCTGTTTTACATCTATACTTCTGGGACCACCG 760 
GGCTTCCTAAGGCTGCCATTGTGGTGCACAGCAGGTACTA 800 

810 820 830 840 

ill' I l I l l i l l i r 1 i . i ■ I i ■ i ■ I , i ■ i 1 , , ■ i I i , , , I 

CCGCATTGCTGCCTTTGGCCACCATTCCTACAGCATGCGT 840 
GCCGCCGATGTGCTCTATGACTGCCTGCCACTCTACCACT 880 
CTGCAGGGAACATCATGGGTGTGGGGCAGTGCGTCATCTA 920 
CGGGTTGACGGTGGTACTGCGCAAGAAGTTCTCCGCCAGC 960 
CGCTTCTGGGATGACTGTGTCAAGTACAATTGCACGGTAG 1000 

1010 1020 1030 1040. 

■ ' ' ' I ■ ' ■ ' I ' ' ■ ' ' ' ■ ' ■ I ' ' ' ■ i ■ ■ ' ■ I ■ ■ ■ ■ ' ' i ■ ■ I 

TGGATGACATAGGTGAAATCTGCCGCTACCTGCTGAGGCA 1040 
GCCGGTTCGCGACGTGGAGCAGCGACACCGCGTGCGCCTG 1080 
GCCGTGGGTAATGGGCTGCGGCCAGCCATCTGGGAGGAGT 1 120 
TC AC GC AGCGC TTCGGTGTGCC AC AG ATCGGCGAGTTCTA 1 160 
CGGCGCTACCGAGTGCAACTGCAGCATTGCCAACATGGAC 1200 



mmFATPI full length. DNA 



1210 1220 1230 1240 

■ i ■ ■ i ' i ■ ' i ' ■ ■ ■ i ■ ' ■ ' i ' ■ ' ' i > ■ ' ■ i * 1 ' ■ ' ■ ■ ■ ■ i 

GGCAAGGTCGGCTCCTGCGGCTTCAACAQCCGTATCCTCA 1240 
CGCATGTGTACCCCATCCGTCTGGTCAAGGTCAATGAGGA 1280 
CACGATGGAGCCACTGCGGGACTCCGAGGGCCTCTGCATC 1320 
CCGIGCCAGCCCGGGGAACCCGGCCTTCTCGTGGGCC AGA 1360 
TCA'ACCAGCAGGACCCTCTGCGGCGTTTCGATGGTTATGT 1400 

1410 1420 1430 1440 

r i t r I i t r r I f t i ? \ t t f t 1 f t t i 1 t i i i f i i r i I i t i f I 

TAGTGACAGTGCCACCAACAAGAAGATTGCCCACAGCGTT 1440 
TTCCGAAAGGGCGATAGCGCCTACCTCTCAGGTGACGTGC 1480 
TAGTGATGGACGAGCTGGGCTACATGTATTTCCGTGACCG 1520 
CAGCGGGGACACCTTCCGCTGGCGCGGGGAGAACGTGTCC 1560 
ACCACGGAGGTGGAAGCCGTGCTGAGCCGCCTACTGGGCC 1600 

1610 1620 1630 1640 

t I I I 1 t T I I I f I t 1 I t t I I I I t I I I I I I T i t I I 1 f 1 I 1 t 1 

AGACGGACGTGGCTGTGTATGGGGTGGCTGTGCCAGGAGT 1640 
GGAGGGGAAAGCTGGCATGGCAGCCATCGCAGATCCCCAC 1680 
AGCCAGTTGGACCCTAACTCAATGTACCAGGAATTACAGA 1720 
AGGTTCTTGCATCCTATGCTCGGCCCATCTTCCTGCGTCT 1760 
TCTGCCCCAGGTGGATACC ACAGGCACCTTCAAGATCCAG 1800 

1810 1820 1830 1840 

■ * * T \ t t f I I T f T I f T T t t I t f 1 f ! I I 1 I t f t T T I 1 t I I I 

AAGACCCGGCTGCAGCGTGAAGGCTTTGACCCCCGTCAGA 1840 
CCTCAGACAGGCTCTTCTTTCTAGACCTGAAGTCCGGCAC 1880 
GAGGTATCTACCCCTGGATGAGAGAGTCCATGCCCGCATT 1920 
TGCGCAGGCGACTTCTCACTCTGAGCCTGGAGAGTGGGCT 1960 
GGGCCTGGACTCCTGAGACCTGGGAGCCTGAC ACCCCTCT 2000 

2010 2020 2030 2040 

■ t i ■ I i ■ . i I i i ■ i I i i i i I i t i i I i i r t i i i r i I i i i i 1 

TCGGGTGCTTCTCCTGCCTGGCCACATGGACAGCAGCACC 2040 
TGTGAGAGTAGGAAAATGGAACCTGAGTGGCTGGGACCCC 2080 
TCTCCTACTTCCCACTATGCATCCATTTTGCCTCTGCCTT 2120 
GATCTTTTTCTCCATCTCTTTTCTCCCTACCCAGCAGGAG 2160 
CCCCACAAACACATGTTGGCTGCTGTGTCCTGCAGTTGGA 2200 

2210 2220 2230 2240 

t t I i 1 t f f t t i i t t I i t t i j i r i t 1 t i t t I > * t t I i f t ) 1 , , 

CCAGTGTCCAGGGGTACAGGCTTCAGGCTGTGACCCACAC 2240 
TGGTACCCACCTCCCTTTCCTATTTTGCCTf AGGTTCATC 2280 
CACGGTTCCCCTGTGGAGCAAGTGGGGGCCCACATAGCTG 2320 
CTGTCCCTGCTGAGGGTTGGTAGCAATCACACCCTCATGT 2360 
CAGCTGGGAGACACGCGCAGTCTCCCACTGACCCCCAATC 2400 

2410 2420 2430 2440 

AACTGAAAATATTGTTTTGACTACTTTTTGTTTTTTTGTT 2440 
TTTTTGTTTTTTTTTTTTTTCGAGACAGAGTTTCTCTGTA 2480 
TAGCCCTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCT 2520 
GGCCTCGAACTCAAAAATCCTCCTGACTCTGCCTCTGCTT 2560 
CCCAAGTGCTGGGATTAAAGACGTGCGCCACCACCGCCTG 2600 



f(a. 6J.B 



mmFATPI full length.DNA 



2610 2620 2630 2640 

• ■ ■ ' i ■ ' ■ * i t t ■ • i ... i i i i t i i i i > t I ? i t i I i i i t I 

GCTGTTTTQTATTTTTGTTTTGTTTTGACGATAGGGTCTC 2640 
ACTGTGGAGGCCAAGCTGGCCTCAGACTCCCCACCCCATT 2680 
GCCTCTGGGCACCATTCTATATTCTCAGACTGATGACAAT 2720 
GCACTAGTGTCCCTAGGAGTCTTGAGTCTGCACTTTCCCC 2760 
TCATAGCCTCAAGCTTCCAGAACTGACTCTGATCACTTGG 2800 

2810 2820 2830 2840 

, . . . I | I i ■ I ■ r I . I . . ■ I I I ■ I I I I ■ ■ ■ I I ' ' ' I I ' I I 1 

ATGTGGCTAGTGTTGGCTCTACCCACATGTGTCAATTCAG 2840 
GGGTCCCCAGGCATAGTCTCTGGAAGCCCTCACCCGGAAA 2880 
AAGCTTGGAGAGACCCAGGAAGGTTGTTGTGTTCTCTTGG 2920 
GCACCCCCTGGTGGCAGTCCTGGGCATGCTTCCGCACTGT 2960 
ACTGGTGCATATAGCCCAGACCTATGACATTTGAGGTCTA 3000 

3010 3020 3030 3040 

It'll I . t T I 1 .1)1 T f 1 I I l.T.t . I . I I ftTtl ttttl 

CCCTTCTGGCTCCTGTGGTCCCCATTGAGATCCTTGGTGA 3040 
CTCACCTCAGTCACCAAGCAGAGCCTCTGCCTGCCTTCAT 3080 
CTTCAAGGTCATGAAGGATGTGGACAGAGCAGCTACAGGC 3120 
TGCCAGCAGTCAACCACATGAGAGTGTTACTTCCTTGTTG 3160 
GTTTTTAAAAAATAAATGTGCTGAGCCTCGAAAAAAAAAA 3200 

3210 3220 3230 3240 

. . . ■ » < I ■ . I t . . T I I . . . ) . I ■ I I ■ I I I I I I T I 1 1 I I t I 

AAAAAAAAAAAAAAAAA 3217 



mmFATPI fuil length. protein 



10 20 30 40 
. v i i * i i 1 i i 1 > t » t ) i t i- 1 1 t ? * i i 1 t i t l i i t t i i i t i 1 

MRAPGAGTASVASLALLWFLGLPWTWSAAAAFCVYVGGGG 40 
WRFLR I VCKTARRDLFGLSVLI RVRLELRRHRRAGDT IPC 80 
•I FQAVARRQPERLALVDASSG I CWTFAQLDTYSNAVANLF 120 
RQLGFAPGDVVAVFLEGRPEFVGLWLGLAKAGVVAALLNV 160 
NLRREPLAFCLGTSAAKAL I YGGEMAAAVAEVSEQLGKSL 200 

210 220 230 240 

■ ■ ■ ' I I I i I 1 I ■ T I I i I i I I I i l I I . i I i I r i I i I I I I i;l 

LKFCSGDLGPES ILPDTQLLDPMLAEAPTTPLAQAPGKGM 240 
DDRLFY I YTSGTTGlPKAAl VVHSRYYR I AAFGHHSYSMR 280 
AADVLYDCLPLYHSAGN I MGVGQCV I YGLTVVLRKKFSAS 320 
RFWDOCVKYNCTVVDD I GE I CRYLLRQPVRDVEQRHRVRL 360 
AVGNGLRPAIWEEFTQRFGVPQ I GEFYGATECNCS I ANMD '400 

410 420 430 440 

r » I I I I » t T 1 1. I I 1 .t I I t t f t i t » I « t 1 I t I t 1 I I I. I I I 1 

GKVGSCGFNSR ILTHVYP I RLVKVNEDTMEPLROSEGLC I 440 
PCQPGEPGLLVGQ I NQQDPLRRFDGYVSDSATNKK I AHSV 480 
FRKGDSAYLSGDVLVMDELGYMYFRDRSGDTFRWRGENVS 520 
TTEVEAVLSRLLGQTDVAVYGVAVPGVEGKAGMAAI ADPH 560 
SQLDPNSMYQELQKVLASYARP I FLRLLPQVDTTGTFK I Q 600 

610 620 630 640 

* ■ ■ * ' ' ■ ■ ■ i ■ ' ■ ' i ' ' > ' I ■ ' ' ■ ' ' ■ ■ * 1 ' ■ * ' i ' ' ■ ' I 

KTRLQREGFDPRQTSDRLFFLOLKSGTRYLPLDERVHAR I 640 
CAGDFSL 647 



ft*. C3 



mmFATP2full length.PNA 



10 20 30 40 

T 1 T I t t I I \ 1 > t t \ 1 t t I I t 1 1 t I t I ! « 1 1 I ! T I trill 

GGGCGGAGGCCGAGCCCAGTCGCCAGCTCCTGCTCTGCTC 40 
CTCTCCCGCCTGCCGCCGCGCTGCACGCCTCGAGCACTCC 80 
CTCC-GCCCCGGCGGGGACCGGGGACCCCGCAGCTACCGCC 120 
ATGCTGCCAGTGCTCTACACCGGCCTGGCGGGGCTGCTGC 160 
TGCTGCCTCTGCTGCTCACCTGCTGCTGCCCCTACCTCCT 200 

210 220 230 240 

1 i r 1 ) t t f ? I . f t t ? j i ? l I ! I i 1 t 1 t 1 1 1 1 ? f t I 1 ! i t I 1 

CCAAGATGTGCGGTACTTCCTGCGGCTGGCCAACATGGCC 240 
CGGCGGGTGCGCAGCTACCGGCAGCGGCGACCCGTGCGTA 280 
CCATCCTGCGGGCCTTCCTGGAACAAGCGCGCAAGACCCC 320 
ACACAAGCCCTTCCTGCTGTTCCGAGACGAGACGCTCACC 360 
TACGCCCAGGTGGACCGGCGCAGCAACCAAGTGGCGCGGG .400 

410 420 430 440 

■ ■ ' 1 1 * ■ ■ ' t 1 1 1 1 1 1 1 ■ ■ I 1 ' ' ' 1 ' ■ • ' * ■ 1 1 1 1 1 1 1 1 I 

CGCTGCACGATCAACTGGGCCTACGACAGGGGGATTGCGT 440 
AGCCCTCTTCATGGGCAATGAGCCGGCCTACGTGTGGATC 480 
TGGCTGGGACTGCTCAAACTGGGCTGTCCCATGGCGTGCC 520 
TCAACTACAACATTCGTGCCAAGTCTCTGCTGCACTGCTT 560 
TCAATGCTGCGGGGCGAAGGTGCTGCTGGCCTCCCCAGAT 600 

610 620 630 640 

.,1 I T I 1 , ,1 f t 1 I I I 1 I 1 Till) f ,T ? I I Tiff! I 1 I I 1 I \ I T 1 

CTACAAGAAGCTGTGGAGGAGGTTCTTCCAACCCTGAAAA 640 
AGGATGCCGTGTCCGTCTTTTACGTAAGC AGAACTTCTAA 680 
CACAAATGGTGTGGACACAATACTGGACAAAGTAGACGGA 720 
GTGTCGGCGGAACCC ACCCCGGAGTCGTGGAGGTCTGAAG 760 
TCACTTTTACCACGCCAGCAGTATACATTTATACTTCGGG 800 

810 820 830 840 

■ ' ■ ' * ■ * ■ ■ i ' ' ' ■ i ' ' ■ ' i ■ ' ' ' i ' ' < ' i * ' ' ' 1 ■ ■ ' ' i 

AACCACAGGTCTTCCAAAAAGCGGAACCATCAATCATCAT 840 
CGCCTAAGGTATGGGACAAGCCTTGCTATGTCGAGTGGGA 880 
ATCACGGCCAAGGATGTCATCTATACCAACAATGCCCCTG 920 
TTCCAACAGTGCAACGCTCAAGATCGGCCTTCACGGATGC 960 
ATCCTGGGTTGGGGCTACTTTAACCTTGGCGGGGCAAATT 1000 

1010 1020 1030 1040 

' 1 ' ' i ■ ■ * * I ' ' ■ ' i > ' ' ■ I * ■ ■ 1 1 ' ' 1 ■ 1 ' ■ < ■ 1 ' ' ' ■ I — : 

CTCAAGCAAGCCAATTTTGGGAACGACTGGCAGGAAATAC 1040 
AACGTCAACGGTCATTCAGTACATTGGTGAACTGCTTCGG 1080 
TACCTGTGCAACACACCGCAGAAACCAAATGACCGGGACC 1 120 
ACAAAGTGAAAAAAGCCCTGGGAAATGGCTTACGAGGAGA 1 160 
TGTGTGGAGAGAGTTCATCAAGAGATTTGGGGACATCCAC 1200 



F? 4 . GHA 



mmFATP2 full length.DNA 



1210 1220 1230 1240 

T I » 1 f * ' t T 1 ■ 1 t t ! t I T t I I t I T ! t 1 * T 1 t T I T I t t I 1 1 

GTGTATGAGTTCTACGCATCCACTGAAGGCAACATTGGAT 1240 
TTGTGAACTATCCAAGGAAAATCGGTGCTGTCGGGAGAGC 1280 
AAACTACCTACAAAGAAAAGTTGCAAGGTATGAGCTGATC 1320 
AAGTATGACGTGGAGAAGGACGAGCCGGTCCGTGACGCAA 1360 
ATG ? £ATATTGCATCAAAGTCCCCAAAGGTGAGGTTGGACT 1400 

1410 1420 1430 1440 

t • ■ < i ■ ■ ■ i I ' > ' ' I ■ ' ■ ' I ' ' i ' i ' ■ ' ■ I ' ' ' ■ I ' ' ' ' I 

CTTGGTTTGCAAAATCACACAGCTCACACCATTTATTGGC 1440 
TATGCTGGAGGAAAGACCCAGACAGAGAAGAAAAAACTCA 1480 
GAGATGTCTTTAAGAAAGGCGACATCTACTTCAACAGCGG 1520 
AGACCTCCTGATGATCGACCGTGAGAACTTCGTCTACTTT 1560 
CACGACAGGGTTGGAGATACTTTCCGGTGGAAAGGAGAGA 1600 

1610 1620 1630 1640 

■ • < i i ■ ■ i ■ I ■ ' ' ' ' ' ■ ■ 1 I ' 1 ' ■ 1 ' 1 ' ■ 1 ■ 1 ■ 1 1 ' ' ■ ■ I 

ACGTAGCTACCACAGAAGTCGCTGACATCGTGGGACTGGT 1640 
AGATTTTGTTGAAGAAGTGAATGTGTATGGCGTGCCTGTG 1680 
CCAGGTCATGAGGGTCGAATTGGGATGGCCTCCCTCAAGA 1720 
TCAAAGAAAACTACGAGTTCAATGGAAAGAAACTCTTTCA 1760 
ACACATCGCGGAGTACCTGCCCAGTTACGCGAGGCCTCGG 1800 

1810 1820 1830 1840 

t t i i f i t i t I t t f t I f i t t i * r t f I t t i , } 1 1 1 \ 1 r 1 I t ! 

TTCCTGAGGATACAAGATACCATTGAGATCACTGGGACTT 1840 
TTAAACACCGCAAAGTGACCCTGATGGAAGAGGGCTTCAA 1880 
TCCCACAGTCATCAAAGATACCTTGTATTTCATGGATGAT 1920 
GCAGAGAAAACATTTGTGCCCATGACTGAGAACATTTATA 1960 
ATGCCATAATTGATAAAACTCTGAAGCTCTGAATATTCCC 2000 

2010 2020 2030 2040 

i . ■ . t ■ ■ ■ ■ i ■ t > i t i ■ ■ ■ i ' ■ ■ ' ' ■ ■ ■ ■ I < ' ■ ' i ' ' ' ' I 

TGGTGGTTTAGCTCATGACATTTCCAGAAAGAAACTCGAT 2040 
AGACCTCGCAGAGCCACTTCATACGTAGAATCCAACTTTA 2080 
ACTTGATTGAAGAC TATAAGGTGCGATTTTATTTTTAGGA 2120 
AATTATTCATTAAAAGGATAGTTTTTTTTTTTTTT 1 iTAA 2160 
TTACACCTGAACCTTTGCAAGTAAAAAGATTTAGAGACAA 2200 

2210 2220 2230 2240 

... f i ■ i t i l t i < i i i i t t I i ■ i i i i i > i 1 t i i i I i i i i I 

TTATTTTTCAATGTGCACCTGCCATTTGTCCTTGCAAACT 2240 
AAGCTTCTTGGAGAGAGGGCCTTATTTTTTTAAAGACATA 2280 
ATAAACTATATTAACACTAAAAAAAAAAAAAAAAAAAAAA 2320 
AAAAAAAAAAAAAAAAAA 2338 



mmFAT P2 full length.protein 



10 20 30 40 

t I t 1 ! t I I t I t I t t I 1 T 1 1 1 * 1 T 1 \ t I 1 1 I T I T T I I T t ! ( 

I MLPVLYTGLAGlLLtPLLLTCCCPYLLQDVRYFLRLANMA 40 
i, RRVRSYRQRRPVRTILRAFLEQARKTPHKPFLLFRDETLT 80 
! YAQVDRRSNQVARALHOQLGLRQGDCVALFMGNEPAYVWI 120 
WLGILKLGCPMACLNYN I RAKSLLHCFQCCGAKVLLASPD 160 
LQEAVEEVLPTLKKDAVSVFYVSRTSNTNGVDT I LDKVDG 200 

210 220 230 240 

t ? i t T i t l l I i i t I I i i t i 1 f I r i I t i i i 1 i t i r I i t < t \ 

I VSAEPTPESWRSEVTFTTPAVY I YTSGTTGLPKSGT I NHH 240 
RLRYGTSLAMSSGNHG.QGCHLYQQCPCSNSATLK I GLHGC 280 
' ILGWGYFNLGGANSGASQFWERLAGNTTSTV I QY I GELLR 320 
' YLCNTPQKPNDRDHKVKKALGNGLRGDVWREFI KRFGD IH 360 
: VYEFYASTEGN I GFVNYPRK I GAVGRANYLQRKVARYEL I 400 
I 410 420 430 440 

j . i ■ t I ■ I ■ t 1 t ' ■ I I I ' ■ r 1 ■ t . I I I T I 1 I I I I I I I I 1 I I 

! KYDVEKDEPVRDANGYC I KVPKGEVGLLVCK ITQLTPF IG 440 
j YAGGKTQTEKKKLRDVFKKGD I YFNSGDLLM I DRENFVYF 480 
I HDRVGDTFRWKGENVATTEVAD I VGLVOF VEEVNVYGVP V 520 
PGHEGRI GMASLK I KENYEFNGKKLFQH I AEYLPSYARPR 560 
: FLR IQDTI E ITGTFKHRKVTLMEEGFNPTV I KDTLYFMDD 600 

610 620 630 640 

t I I I 1 I I I 1 I 1 ' I T I I I I I I ■ • I T I I I I I I < 1 I I I I I I I I 

i AEKTFVPMTEN I YNA I IQKTLKL. 624 

i 

1 ■ fv«. C5 



mmFATP3 partial.DNA 



10 20 30 40 

GAAAGCTCTGAGAGCGGGTGCAGTCTGGCCTGGCGTCTCG 40 
CGTACCTGGCCCGGGAGCAGCCGACACACACCTTCCTCAT 80 
CCACGGCGCGCAGCGCTTTAGCTACGCGGAGGCTGAGCGC 120 
GAGAGCAACCGGATTGCTCGCGCCTTTCTGCGCGCACGGG 160 
GCTGGACCGGGGGCCGCCGAGGCTCGGGCAGGGGCAGCAC 200 

210 220 230 240 

t i t i 1 ? r i i 1 i i i ? 1 t t i r I t t I t j i i t ? f i i i i ) ■ t t i I , 

TGAGGAAGGCGCACGCGTGGCGCCTCCGGCTGGAGATGCG 240 
GCTGCTAGAGGGACGACCGCGCCCCCTCTGGCACCCGGGG 280 
CGACCGTGGCGCTGCTCCTCCCAGCGGGCCCGGATTTCCT 320 
TTGGATTTGGTTCGGACTGGCCAAAGCTGGCCTGCGCACG 360 
GCCTTTGTGCCCACCGCTTTACGCCGAGGACCCCTGCTGC 400 

410 420 430 440 

■ ' ' ' ' ' 1 1 1 I ■ ■ ■ ■ * ' ■ ' ■ I ' * ■ ' 1 1 > ' 1 i ■ 1 ' ■ 1 1 ■ ' ' i 

ACTGCCTCCGCAGCTGCGGTGCGAGTGCGCTCGTGCTGGC 440 
CACAGAGTTCCTGGAGTCCCTGGAGCCGGACCTGCCGGCC 480 
TTGAGAGCCATGGGGCTCCACCTATGGGCGACGGGCCCTG 520 
AAACTAATGTAGCTGGAATCAGCAATTTGCTATCGGAAGC 560 
AGCAGACCAAGTGGATGAGCCAGTGCCGGGGTACCTCTCT 600 

610 620 630 640 

.' * < ' I * t 1 I I 1 f T T I T I I t t I I 1 T \ 1 t t 1 I I r I f f I I I t 1 

GCCCCCCAGAACATAAT.GGACACCTGCCTGTACATCTTCA 640 
CCTCTGGCACTACTGGCCTGCCCAAGGCTGCTCGAATCAG 680 
TCATCTGAAGGTTCTACAGTGCCAGGGATTCTACCATCTG 720 
TGTGGAGTCCACCAGG AGGACGTGATCTACCTCGCACTCC 750 
CACTGTACCACATGTCTGGCTCCCTTCTGGGCATTGTGGG 800 

810 820 830 840 

' t t > i t i t i I t i i t i 1 t t t t i t t t i i , i T i t t t ) i i , T i i 

CTGCTTGGGCATTGGGGCCACCGTGGTGCTGAAACCCAAG 840 
TTCTCAGCTAGCCAGTTCTGGGACGATTGCCAGAAACACA 880 
GGGTGACAGTGTTCCAGTACATTGGGGAGTTGTGCCGATA 920 
CCTCGTCAACCAGCCCCCGAGCAAGGCAGAGTTTGACCAT 960 
AAGGTGCGCTTGGCAGTGGGCAGTGGGTTGCGCCCAGACA 1000 

1010 1020 1030 1040 

i t * ' 1 i ( i t f tir>) i _ t i i I it t il tittl ittit ittit 

CCTGGGAGCGTTTCCTGCGGCGATTTGGACCTCTGCAGAT 1040 

ACTGGAGACGTATGGCATGACAGAGGGCAACGTAGCTACG 1080 

TTCAATTACACAGGACGGCAGC-GTGCAGTGGGGCGAGCTT 1 120 

CCTGGCTTTACAAGCACATCTTCCCCTTCTCCTTGATTCG 1 160 

ATACGATGTCATGACAGGGGAGCCTATTCGGAATGCCCAG 1200 



fu. 60 A 



mmFATP3 partial.DNA 



1210 1220 1230 1240 

1 f t t t » t f t 1 t i t i t i i i i 1 t ii i 1 i * t i 1 i t i t 1 i i i i I _ 

GGGCACTGCATGACCACATCTCCAGGTGAGCCAGGCCTAC 1240 
TGGTGGCCCCAGTGAGCCAGCAGTCCCCCTTCCTGGGCTA 1280 
TGCTGGGGCTCCGGAGCTGGCCAAGGACAAGCTGCTGAAG 1320 
GATGTCTTCTGGTCTGGGGACGTTTTCTTCAATACTGGGG 1360 
ACC/CTTGGTCTGTGATGAGCAAGGCTTTCTTCACTTCCA 1400 

1410 1420 1430 1440 

> i i t I t i i « f f t i t I t t f i ( r t t l I t t T 1 1 T I I T I ' f T 1 1 

CGATCGTACTGGAGACACCATCAGGTGGAAGGGAGAGAAT 1440 
GTGGCCACAACTC-AAGTGGCTGAGGTCTTGGAGACCCTGG 1 480 
ACTTCCTTCAGGAGGTGAACATCTATGGAGTCACGGTGCC 1520 
AGGGCACGAAGGCAGGGCAGGCATGGCGGCCTTGGCTCTG 1560 
CGGCCCCCGCAGGCTCTGAACCTGGTGCAGCTCTACAGCC 1600 

1610 1620 1630 1640 

■ ■ ■ t ' i i i ' i i ■ ■ ■ i ■ ' ■ ■ i ■ ■ ■ ■ i ■ ' ' ' l ■ <■■ ' ' i ' ' ' ' I 

ATGTTTCTGAGAACTTGCCACCGTATGCCCGACCTCGGTT 1640 
TCTCAGGCTCCAGGAATCTTTGGCCACTACTGAGACCTTC 1680 
AAACAGCAGAAGGTTAGGATGGCCAATGAGGGCTTTGACC 1720 
CCAGTGTACTGTCTGACCCACTCTATGTTCTGGACCAAGA 1760 
TATAGGGGCCTACCTGCCCCTCACACCTGCCCGGTACAG I 1800 

1810 1820 1830 1840 

I I I I 1 ■ I ' < I ■ 1 I t I I I ■ I I I T I . I I I t t [ I I I f I 1 I I I I 

GCCCTCCTGTCTGGAGACCTTCGAATCTGAAACCTTCCAC 1840 
TTGAGGGAGGGGCTCGGAGGGTACAGGCCACCATGGCTGC 1880 
ACCAGGGAGGGTTTTCGGGTATCTTTTGTATATGGAGTCA 1920 
TTATTTTGTAATAAACAGCTGGAGCTTAAAAAAAAAAAAA 1960 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1 1 998 



ft 4- CC& 



mmFATP3 partial.protein 




ESSESGCSLAWRLAYLAREQPTHTFLIHGAQRFSYAEAER 40 
ESNRI ARAFLRARGWTGGRRGSGRGSTEEGARVAPPAGDA 80 
AARGTTAPPLAPGATVALLLPAGPDFLWIWFGLAKAGIRT 120 
AFVPTALRRGPLLHCLRSCGASALVLATEFLESLEPDLPA 160 
LRAMGLHLWATGPETNVAGISNLLSEAADQVDEPVPGYLS 200 

210 220 230 240 



. APQNI MDTCLYI FTSGTTGLPKAAR ISHLKVLQCQGFYHL 240 
CGVHQEDVI YLALPLYHMSGSLLG I VGCLG I GATVVLKPK 280 
FSASQFWDDCQKHRVTVFQYI GELCRYLVNQPPSKAEFDH 320 
I KVRLAVGSGLRPDTWERFLRRFGPLQI LETYGMTEGNVAT 360 
I FNYTGRQGAVGRASWLYKH I FPFSL I RYDVMTGEP I RNAQ; 400 



GHCMTTSPGEPGLLVAPVSQQSPFLGYAGAPELAKDKLLK 440 
DVFWSGDVFFNTGDLLVCDEQGFLHFHDRTGDT I RWKGEN 480 
VATTEVAEVLETLDFLQEVN I YGVTVPGHEGRAGMAALAL 520 
RPPQALNLVQLYSHVSENLPPYARPRFLRLQESLATTETF 560 
KQQKVRMANEGFDPSVLSDPLYVLDGD I GAYLPLTPARYS 600 
610 620 630 640 



fiit 




■ ■ ■ * 1 < 1 ■ 




440 
I i i i i I 




■ ■ l 



mmFATP4 full length.DNA 



1 10 20 30 40 

, , , , i , . ■ . l i ■ • ' ' ■ ' ■ ' I ■ ■ ' ' I ' ' ' ' I ' ' ' 1 I ' 1 ' ' 1 

ATGCTGCTTGGAGCCTCTCTGGTGGGGGCGCTACTGTTCT 40 
CCAAGCTAGTGCTGAAC-CTGCCCTGGACCCAGGTGGGAT t 80 
CTCCCTGTTGCTCCTGTACTTGGGGTCTGGTGGCTGGCG I 120 
TTCATCCGGGTCTTCATCAAGACGGTCAGGAGAGATATCT 160 
TTGGTGGCATGGTGCTCCTGAAGGTGAAGACCAAGGTGCG 200 

210 220 230 240 

, ■ ■ . I ■ ■ ■ ■ I ' ' ■ ■ 1 T t T I I I I I I I 1 I I I 1 I I I I I I I I 1 1 

ACGGTACCTTCAGGAGCGGAAGACGGTGCCCCTGCTGTTT 240 
GCTTCAATGGTACAGCGCCACCCGGACAAGACAGCCC I GA 280 
TTTTCGAGGGCACAGACACTCACTGGACCTTCCGCCAGC I 320 
- GGATGAGTACTCCAGTAGTGTGGCCAACTTCCTGCAGGCC 360 
CGGGGCCTGGCCTCAGGCAATGTAGTTGCCCTCTTTATGG 400 

410 420 430 440 

, ■ ■ ■ I , . ■ . t . ■ t , I ■ I . ■ I ■ • ■ • I < < < I 1 ' I ' T I ' ' I ' i 

AAAACCGCAATGAGTTTGTGGGTCTGTGGCTAGGCATGGC 440 
CAAGCTGGGCGTGGAGGCGGCTCTCATCAACACCAACCTT 480 
AGGCGGGATGCCCTGCGCCACTGTCTTGACACC I CAAAGG 520 
CACGAGCTCTCATC TTTGGCAGTGAGATGGCCTCAGCTA I 560 
CTGTGAGATCCATGCTAGCCTGGAGCCCACACTCAGCC.TC 600 

610 620 630 640 

.... t i i I i i t ■ i t i i ■ l i t t i i i i i i 1 i ■ t i I i i i i I 

TTCTGCTCTGGATCCTGGGAGCCCAGCACAGTGCCCGTCA 640 
GCACAGAGCATCTGGACCCTCTTCTGGAAGATGCCCCGAA 680 
GCACCTGCCCAGTCACCCAGACAAGGGTTTTACAGATAAG 720 
CTCTTCTACATCTACACATCGGGCACC ACGGGGCTACCCA 760 
AAGCTGCCATTGTGGTGCACAGCAGGTATTATCG I A I GGC 800 

810 820 830 840 

, , ■ . i i . . . I i i ■ < ' ' ' ■ ■ ' ■ ■ ' ' ' < ■ ' ' 1 ' ' ' ■ I ' ' ' 1 1 

TTCCCTGGTGTACTATGGATTCCGCATGCGGCCTGATGAC 840 
ATTGTCTATGACTGCCTCCCCCTCTACCACTCAAGCAGGA 880 
AACATCGTGGGGATTGGCAGTGCTTACTCCACGGCATGAC 920 
TGTGGTGATCCGGAAGAAGTTCTCAGCCTCCCGGTTCTGG 960 
GATGATTGTATCAAGTACAACTGCACAGTGGTACAGTACA 1000 

1010 1020 1030 1040 

■ I 1 , I 1 . . i I r , . i I > < I ■ 1 ■ ' ■ ' ' ' ' ' ' I ' > ' ' I t 1 1 I I 

TTGGCGAGCTCTGCCGCTACCTCCTGAACCAGCCACCCCG 1040 

TGAGGCTGAGTCTCGGCACAAGGTGCGCATGGCACTGGGC 1080 

AACGGTCTCCGGCAGTCCATCTGGACCGACTTCTCCAGCC 1 120 

GTTTCCACATCCCCCAGGTGGCTGAGTTCTATGGGGCCAC 1 160 

TGAATGCAACTGTAGCCTGGGC AACTTTGACAGCCGGGTG 1200 



fr«. CSA 



mm F ATP 4 fuii length. DNA 



1210 1220 1230 1240 

T < f t ! i t i i 1 t t i i I iii;! t i ? ! 1 i t i i 1 t i » i ] t r i i ! 

GGGGCCTGTGGCTTCAATAGCCGCATCCTGTCCTTTGTGT 1240 
ACCCTATCCGTTTGGTACGTGTCAATGAGGATACCATGGA 1280 
ACTGATCCGGGGACCCGATGGAGTCTGCATTCCCTGTCAA 1320 
CCAGGTCAGCCAGGCCAGCTGGTGGGTCGCATCATCCAGC 1360 
AG^ACCCTCTGCGCCGTTTCGACGGGTACCTCAACCAGGG 1400 

1410 1420 1430 1440 

■ i ■ ■ l ■ ■ ■ ■ I ■ ' ■ ' I ■ ' ■ ■ I ■ ■ ■ ■ 1 ■ ' ' ' I ■ ' ■ ' I ' ' < ' ! 

TGCCAACAACAAGAAGATTGCTAATGATGTCTTCAAGAAG 1440 
GGGGACCAAGCCTACCTCACTGGTGACGTCCTGGTGATGG 1480 
ATGAGCTGGGTTACCTGTACTTCCGAGATCGCACTGGGGA 1520 
CACGTTCCGCTGGAAAGGGGAGAATGTATCTACCACTGAG 1560 
GTGGAGGGCACACTCAGCCGCCTGCTTCATATGGCAGATG 1600 

1610 1620 1630 1640 

» ■ ■ ■ i ■ ' ■ 1 I 1 1 ' ' i 1 1 ■ ' I 1 1 ' ■ i ■ 1 ' ' * 1 ' ' ' 1 ' 1 ' 1 1 

TGGCAGTTTATGGTGTTGAGGTGCCAGGAACTGAAGGCCG 1640 
AGCAGGAATGGCTGCCGTTGCAAGTCCCATCAGCAACTGT 1680 
GACCTGGAGAGCTTTGCACAGACCTTGAAAAAGGAGCTGC 1720 
CTCTGTATGCCCGCCCCATCTTCCTGCGCTTCTTGCCTGA 1760 
GCTGCACAAGACAGGGACCTTCAAGTTCCAGAAGACAGAG 1800 

1810 1820 1830 1840 

■ t t t I I r I I I i I l i I ■ ■ i i I t i ■ ■ I i t ■ I I i I I r I i 1 t t I 

TTGCGGAAGGAGGGCTTTGACCCATCTGTTGTGAAAGACC 1840 
CGCTGTTCTATCTGGATGCTCGGAAGGGCTGCTACGTTGC 1880 
ACTGGACCAGGAGGCCTATACCCGCATCCAGGCAGGCGAG 1920 
GAGAAGCTGTGATTTCCCCCTACATCCCTCTGAGGGCCAG 1960 
AAGATGCTGGATTCAGAGCCCTAGCGTCCACCCCAGAGGG 2000 

2010 2020 2030 2040 

■ ' ' i ' ■ ' ■ i I ' ' » ■ i ■ ■ ■ ■ I ' ' ■ ' i ' ■ ■ ' I ' ■ ' ' i ' ' ' > I 

TCCTGGGCAATGCCAGACCAAAGCTAGCAGGGCCCGCACC 2040 
TCCGCCCCTAGGTGCTGATCTCCCCTCTCCC AAACTGCCA 2080 
AGTGACTCACTGCCGCTTCCCCGACCCTCCAGAGGCTTTC 2120 
TGTGAAAGTCTCATCCAAGCTGTGTCTTCTGGTCCAGGCG 2160 
TGGCCCCTGGCCCCAGGGTTTCTGATAGGCTCCTTTAGGA 2200 

2210 2220 2230 2240 

■ i ■ > i ■ i i ■ I i ■ i i i ' ■ i ■ l ' ' ■ ' t ■ ' ' ' I ■ ' ' ' 1 ' ■ ' ■ I 

TGGTATCTTGGGTCCAGCGGGCCAGGGTGTGGGAGAGGAG 2240 
TCACTAAGATCCCTCCAATCAGAAGGGAGCTTACAAAGGA 2280 
ACCAAGGCAAAGCCTGTAGACTCAGGAAGCTAAGTGGCCA 2320 
GAGACTATAGTGGCCAGTCATCCCATGTCCACAGAGGATC 2360 
TTGGTCCAGAGCTGCCAAAGTGTCACCTCTCCCTGCCTGC 2400 

2410 2420 2430 2440 

t I I I I I I I I I 1 ■ I I I I I t 1 I t t I ! I I I I I I I I I I I I I I 1 1 

ACCTCTGGGGAAAAGAGGACAGCATGTGGCCACTGGGCAC 2440 
CTGTCTCAAGAAGTCAGGATCACACACTCAGTCCTTGTTT 2480 
CTCCAGGTTCCCTTGTTCTTGTCTCGGGGAGGGAGGGACG 2520 
AGTGTCCTGTCTGTCCTTCCTGCCTGTCTGTGAGTCTGTG 2560 
TTGCTTCTCCATCTGTCCTAGCCTGAGTGTGGGTGGAACA 2600 



mmFATP4 full length.DNA 



2610 2620 2630 2640 

■ ■ . . i ■ ■ < < ' ' ■ ■ • ' ■ ■ ' ' ' ' ' ■ ■ 1 ■ ' ' 1 1 ' ' ' ' 1 1 1 ■ ' I 

GGCATGAGGAGAGTGTQGCTCAGGGGCCAATAAACTCTGC 2640 
CTTGACTCCTCTTAAAAAAAAAAAAAAAAAAAAAAAAAAA 2680 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 27 1 0 



fiA-C2C 



mmFATP4 full length.protein 




MLLGASLVGALLFSKLVLKLPWTQVGFSLLLLYLGSGGWR 40 
FIRVF IKTVRRD IFGGMVLLKVKTKVRRYLQERKTVPLLF 80 
ASMVQRHPDKTALIFEGTOTHWTFRQLDEYSSSVANFLQA 120 
RGLASGNVVALFMENRNEFVGLWLGMAKLGVEAAL INTNL. 160 
RROALRHCLDTSKARAL I FGSEMASA I CE I HASLEPTLSL 200 



LFYI YTSGTTGLPKAAI VVHSRYYRMASL.VYYGFRMRPDD 280 
I VYDCLPLYHSSRKHRGDWQCLLHGMTVV IRKKFSASRFW 320 
ODC IKYNCTVVQY I GELCRYLLNQPPREAESRHKVRMALG 360. 
NGLRQSIWTDFSSRFH I PQVAEFYGATECNCSLGNFDSRV 400 



GACGFNSR I LSFVYP I RLVRVNEDTMEL I RGPDGVCI PCQ 440 
PGQPGQLVGR1 I QQDPLRRFDGYLNQGANNKK I ANOVFKK 480 
GDQAYLTGOVLVflOELGYLYFRDRTGDTFRWKGENVSTTE 520 
VEGTLSRLLHMADVAVYGVEVPGTEGRAGMAAVASP I SNC 560 
DLESFAQTLKKELPLYARP i FLRFLPELHKTGTFKFQKTE 600 

610 620 630 640 



LRKEGFDPSVVKOPLFYLOARKGCYVALDQEAYTRIQAGE 640 






I EKL. 644 




mmFAT P5 full length.DNA 



f 10 20 30 40 

* I t t t T 1 1 T ! * T I 1 ! 1 t t 1 f t I 1 t I 1 1 t I f t ! t t ? f . ( t i 

CACTCATCAGAGCTAAGAGAGACTACACGCTCTCATCTAC 40 
TTCAGAAAGAGCCAATGCCATGGGTATTTGGAAGAAACTA 80 
ACCTTACTGCTGTTGCTGCTTCTGCTGGTTGGCCTGGGGC 120 
AGCCCCCAT.GGCCAGCAGCTATGGCTCTGGCCCTGCGTTG 160 
GTTCCTGGGAGACCCCACATGCCTTGTGCTGCTTGGCTTG 200 

210 220 230 240 
■ ' ■ ' i i < i t 1 i i i i i i t i i l i i ■ ■ i i ■ i i I ■ i i i i i i i i l 

GCATTGCTGGGCAGACCCTGGATCAGCTCCTGGATGCCCC 240 
ACTGGCTGAGCCTGGTAGGAGCAGCTCTTACCTTATTCCT 230 
ATTGCCTCTACAGCCACCCCCAGGGCTACGCTGGCTGCAT 320 
AAAGATGTGGCTTTCACCTTCAAGATGCTTTTCTATGGCC 360 
TAAAGTTCAGGCGACGCCTTAACAAACATCCTCCAGAGAC 400 

410 420 430 440 

t i t i I r f t i \ J i I t t i i i i 1 I I , t t f f i r i I t i I i j trill 

CTTTGTGGATGCTTTAGAGCGGCAAGCACTGGCATGGCCT 440 
GACCGGGTGGCCTTGGTGTGTACTGGGTCTGAGGGCTCCT 480 
CAATCACAAATAGCC'AGCTGGATGCCAGGTCCTGTCAGGC 520 
AGCATGGGTCCTGAAAGCAAAGCTGAAGGATGCCGTAATC 560 
CAGAACACAAGAGATGCTGCTGCTATCTTAGTrCTCCCGT 600 

610 620 630 ''640 

' ■ ' > I ' ' ' ' I ■ ' ' ■ i ' ' < ' I ' ■ * 1 i ' ■ 1 ■ I ' ■ ■ ' i ■ ' ' ' l 

CCAAGACCATTTCTGCTTTGAGTGTGTTTCTGGGGTTGGC 640 
CAAGTTGGGCTGCCCTGTGGCCTGGATCAATCCACACAGC 680 
CGAGGGATGCCCTTGCTACACTCTGTACGGAGCTCTGGGG 720 
CCAGTGTGCTGATTGTGGATCCAGACCTCCAGGAGAACCT 760 
GGAAGAAGTCCTTCCCAAGCTGCTAGCTGAGAACATTCAC 800 

810 820 830 840 

_ * f > t I ittri iT»il iTtil > i I. . t f I t I . > 1 ffff? ffffl 

TGCTTCTACCTTGGCCACAGCTCACCC ACCCCGGGAGTAG 840 
AGGCTCTGGGAGCTTCCCTGGATGCTGCACCTTCTGACCC 880 
AGTACCTGCCAGCCTTCGAGCTACGATTAAGTGGAAATCT 920 
CCTGCCATATTCATCTTTACTTCAGGGACCACTGGACTCC 960 
CAAAGCCAGCCATCTTATCACATGAGCGGGTCATACAAGT 1000 

1010 1020 1030 1040 

' ■ ' ' i ■ ' ■ ' I ' ' ' ■ 1 ' > ' ' * ' ' ■ ■ 1 1 ■ ■ ' I ■ ■ ■ ■ 1 ■ ' ■ ■ I 

GAGCAACGTGCTGTCCTTCTGTGGATGCAGAGCTGATGAT 1040 
GTGGTCTATGACGTCCTACCTCTGTACCATACGATAGGGC 1080 
TTGTCCTTGGATTCCTTGGCTGCTTACAAGTTGGAGCCAC 1 120 
CTGTGTCCTGGCCCCCAAGTTCTCTGCCTCCCGATTCTGG 1 160 
GCTGAGTGCCGGCAGCATGGCGTAACAGTGATCTTGTATG 1200 



fr«. 7oA 



mmFATP5 full length.DNA 



1210 1220 1230 1240 

T . , , i i . ■ ■ i i i ' ■ i - i i ■ 1 ■ ' ■ ' 1 ' ■ ' ' l ' 1 ' ' ' ■ ■ 1 1 I — 

TGGGTGAAATCCTGCGGTACTTGTGTAACGTCCCTGAGCA 1240 
ACCAGAAGACAAGATACATACAGTGCGCTTGGCCATGGGA 1280 
ACTGGACTTCGGGCAAATGTGTGGAAAAACTTCCAGCAAC 1320 
GCTTTGGTCCCATTCGGATCTGGGAATTCTACGGATCCAC 1360 
AGAGGGCAATGTGGGCTTAATGAACTATGTGGGCCACTGC 1400 

'' 1410 1420 1430 1440 

. ■ ■ ■ i i ■ ■ . 1 t ■ • ■ i ■ ■ ■ ' I ■ ' ' ' I ' i ' ' I ' ' ' ' I ' ' ' ' I 

GGGGCTGTGGGAAGGACCAGCTGCATCCTTCGAATGCTGA 1440 
CTCCCTTTGAGCTTGTACAGTTCGACATAGAGACAGCAGA 1480 
GCCTCTGAGGGACAAACAGGGTTTTTGCATTCCTGTGGAG 1520 
CCAGGAAAGCCAGGACTTCTTTTGACCAAGGTTCGAAAGA 1560 
ACCAACCCTTCCTGGGCTACCGTGGTTCCCAGGCCGAGTC 1600 

1610 1620 1630 1640 

. . ■ . I I . ■ » I I 1 t . I ■ r t i I I I T T I l I I I I r 1 1 I 1 I 1 I I I 

CAATCGGAAACTTGTTGCGAATGTACGACGCGTAGGAGAC 1640 
CTGTACTTCAACACTGGGGACGTGCTGACCTTGGACCAGG 1680 
AAGGCTTCTTCTACTTTCAAGACCGCCTTGGTGACACCTT 1720 
CCGGTGGAAGGGCGAAAACGTATCTACTGGAGAGGTGGAG 1760 
TGTGTTTTGTCTAGCCTAGACTTCCTAGAGGAAG I CAATG 1800 

1810 1820 1830 1840 

. . , , I i i i i I ■ i ■ i I i i i ■ I ' < i < I i t t . ! I I 1 1 \ I I 1 1 1 . 

TCTATGGTGTGCCTGTGCCAGGGTGTGAGGGTAAGGTTGG 1840 
CATGGCTGCTGTGAAACTGGCTCCTGGGAAGACTTTTGAT 1880 
GGGCAGAAGCTATACCAGCATGTCCGCTCCTGGCTCCCTG 1920 
CCTATGCCACACCTCATTTCATCCGTATCCAGGATTCCC i 1960 
GGAGATCACAAACACCTACAAGCTGGTAAAGTCACGGCTG 2000 

2010 2020 2030 2040 

.... i . . .. l .... i ... i lii ii I it ii 1 ii ii I i i i i I 

GTGCGTGAGGGTTTTGATGTGGGGATCATTGCTGACCCCC 2040 
TCTACATACTGGACAACAAGGCCCAGACCTTCCGGAGTCT 2080 
GATGCCAGATGTGTACCAGGCTGTGTGTGAAGGAACCTGG 2120 
AATCTCTGACCACCTAGCCAACTGGAAGGCAATCCAAAAG 2160 
TGTAGAGATTGACACTAGTC AGCTTCACAAAGTTGTCCGG 2200 

2210 2220 2230 2240 

. , , , t t . t i I > i t t \ i * i i i * ' i ' I ' i ' ' I r l 1 f 1 I T 1 I I 

GTTCCAGATGCCCATGGCCC AGTAGTACTTAGAGAATAAA^2240 
CTTGAATGTGTATACAAAAAAAAAAAAAAAAAAAAAA 22/7 



fu. 7o B 



mmFATP5 fufljength.protem 



10 20 30 40 

•{ t „ t„ 1 ) I t < 1 1 t t t l j t } \ i I t T 1 > I t f r t I I I f T 1 t t T ( ! 

MGIWKKLTLLLLLLLLVGLGGPPWPAAMALALRWFLGDPT 40 
CLVLLGLALLGRPW I SSWMPHWLSLVGAALTLFLLPLQPP 80 
PGLRWLHKDVAFTFKMLFYGLKFRRRLNKHPPETFVQALE 120 
RQALAWPDRVALVCTGSEGSS I TNSQLDARSCQAAWVLKA 160 
KLKDAV I QNTRDAAA I LVLPSKT I SALSVFLGLAKLGCPV 200 

210 220 230 240 

'] i t t I » i J i I ' T t f. 1 1 f T f 1 „ i } 1 if T , T IE 1 1 ■■ T , „ .L ,J...i..-, i . * 1 1 f j , , 



AWINPHSRGMPLLHSVRSSGASVL I VDPDLQENLEEVLPK 240 
LLAEN IHCFYLGHSSPTPGVEALGASLDAAPSOPVPASLR 280 
ATIKWKSPAI F IFTSGTTGLPKPAILSHERV IQVSNVLSF 320 
CGCRADDVVYDVLPLYHT I GLVLGFLGCLQVGATCVLAPK 360 
FSASRFWAECRQHGVTV I IYVGEILRYLCNVPEQPEOK IH 400 

410 420 ' 430 440 

< 1 ■ 1 1 1 ' ■ ' * ' ■ 1 1 1 1 1 1 ' I ' 1 ' ' 1 ' ' ' 1 ¥ 1 ' 1 ' 1 1 ' 1 1 I 



TVRLAMGTGLRANVWKNFQQRFGP IR IWEFYGSTEGNVGL 440 
MNYVGHCGAVGRTSC I LRMLTPFELVQFD I ETAEPLRDKQ 480 
GFCIPVEPGKPGLLLTKVRKNQPFLGYRGSQAESNRKLVA 520 
NVRRVGDLYFNTGOVLTLDQEGFFYFQORLGDTFRWKGEN 560 
VSTGEVECVLSSLDFLEEVNVYGVPVPGCEGKVGMAAVKL 600 

610 620 630 640 
■ ■ ■ ■ i ■ ■ ' ' I ' ■ ■ ' i ■ < ' * I ■ ' ■ ' i ■ ' ■ ' I ' ' ' ■ i ' ' < ■ I 



APGKTFDGQKLYQHVRSWLPAYATPHFIR IQDSLE I TNTY 640 
KLVKSRLVREGFDVGI I ADPLY ILDNKAQTFRSLMPDVYQ 680 
AVCEGTWNL. 690 



dmFATP partial.DNA 



10 20 30 40 

I f t t ! ItTlf Till! 1 I f I I TfTl) ( f j f I I I 1 I 1 

GCTCTCTGGGCCTATATCAAGCTGCTGAGGTACACGAAGC 40 
GCCATGAGCGGCTCAACTACACGGTGGCGGACGTCTTCGA 80 
ACGAAATGTTCAGGCCCATCCGGACAAGGTGGCTGTGGTC 120 
AGTGAGACGCAACGCTGGACCTTCCGTCAGGTGAACGAGC 160 
ATGCGAACAAGGTGGCCAATGTGCTGCAGGCTCAGGGCTA 200 

210 220 230 240 

' ' * ' t ' ' 1 * t * * ' ' 1 ' i * ' I * ' ' ' I i t i t 1 r r * ? I t t i t ! 

CAAAAAGGGCGATGTGGTGGCCCTGTTGCTGGAGAACCGC 240 
GCCGAGTACGTGGCCACCTGGCTGGGTCTCTCCAAGATCG 280 
GTGTGATCACACCGCTGATCAACACGAATCTGCGCGGTCC 320 
CTCCCTGCTGCACAGCATCACGGTGGCCCATTGCTCGGCT 360 
CTCATTTACGGCGAGGACTTCCTGGAAGCTGTCACCGACG 400 

410 420 430 440 

I I I t * T T 1 T f T T I ! I ! I r T [ 1 1 T r i T t ! I I f ! T t I f 1 I t I 

TGGCCAAGGATCTGCCAGCGAACCTCACACTCTTCCAGTT 440 
CAACAACGAGAACAAC AACAGCGAGACGGAAAAGAACATA 480 
CCGCAGGCCAAGAATCTGAACGCGCTGCTGACCACGGCCA 520 
GCTATGAGAAGCCTAACAAGACGCAGGTTAACCACCACGA 560 
CAAGCTGGTCTACATCTACACCTCCGGCACCACAGGAT-TG 600 

610 620 630 640 

i 1 r i \ i f t i I ( i i r { i i i i 1 r f t i t i i t t f t t r r 1 I l i * I 

CCAAAGGCTGCGGTTATCTCTCACTCCCGTTATCTGTTTA 640 
TCGCTGCTGGCATCCACTACACCATGGGTTTCCAGGAGGA 680 
GGACATCTTCTACACGCCCTTGCCTTTGTACCACACCGCT 720 
GGTGGCATTATGTGCATGGGTCAGTCGGTGCTCTTTGGCT 760 
CCACGGTCTCCATTCGCAAGAAGTTCTCGGCATCCAACTA 800 

810 820 830 840 

j > t t f r i * t I rrTtl tiflj Iflff ftltl TtttT tifti 

TTTCGCCGACTGCGCCAAGTATAATGCAACTATTGGTCAG 840 
TATATCGGTGAGATGGCTCGCTACATTCTAGCTACGAAAC 880 
CCTCGGAATACGACCAGAAACACCGAGTGCGTCTGGTCTT 920 
TGGAAACGGACTGCGACCGCAGATTTGGCCACAGTTTGTG 960 
CAGCGCTTCAACATTGCCAAGGTTGGCGAGTTCTACGGCG 1000 

1010 1020 1030 1040 

* t t t I I ■ » t I » * I t I I t r T I I I T I I 1 f I t 1 f I f t 1 I t t t ? 

CCACCGAGGGTAATGCGAACATCATGAATCATGACAACAC 1040 

GGTGGGCGCCATCGGCTTTGTGTCGCGCATCCTGCCCAAG 1080 

ATCTACCCAATCTCGATCATTCGCGCCGATCCGGACACCG 1 120 

GAGAGCCCATTAGAGATAGGAATGGCCTATGCCAACTGTG 1 160 

CGCTCCCAACGAGCCAGGCGTATTCATCGGCAAGATCGTC 1200 



dmFATP partial.DNA 



1210 1220 1230 1240 
■ ' ' ' I ' ' ' ' 1 < ' ■ ' i ■ ■ ■ » i ■ ■ ■ i i i ■ . i I t . . i i . ■ , , | 

AA AG G A AA TCC T TC TC GC G A AT TCCTCGG AT AC GTCGATG 1240 

AAAAGGCCTCCGCGAAGAAGATTGTTAAGGATGTGTTCAA 1280 

GCATGGCGATATGGCTTTCATCTCCGGAGATCTGCTGGTT 1320 

GOCGACGAGAAGGGTTATCTGTACTTCAAGGATCGCACCG 1360 

GTGACACCTTCCGCTGGAAGGGCGAGAATGTTTCCACCAG 1400 

1410 1420 1430 1440 
' ' ' ' I ' ' ' ' I ' ' ' ' i > ■ i i I ■ » ■ . i ■ . . . I ■ ■ . . i . ... i 

CGAGGTGGAGGCGCAAGTCAGCAATGTGGCCGGTTACAAG 1 440 
GATACCGTCGTTTACGGCGTAACCATTCCGCACACCGAGG 1 480 
GAAGGGCCGGCATGGCCGCCATCTATGATCCGGAGCGAGA 1520 
ATTGGACCTCGACGTCTTCGCCGCTAGCTTGGCCAAGGTG 1560 
CTGCCCGCGTACGCTCGTCCCCAGATCATTCGATTGCTCA 1600 
1610 1620 1630 ' 1640 

I I I t I I I > T I T I I t I I I t I I I . I ■ ) i , , , I , , , , | , , , , 1 

CCAAGGTGGACCTGACTGGAACCTTTAAGCTGCGCAAGGT 1640 
AGACCTGCAGAAGGAGGGCTACGATCCGAACGCGATCAAG 1680 
GACGCGCTGTACTACCAGACTTCCAAGGGTCGGTACGAGC 1720 
TGCTCACGCCCCAGGTTTACGACCAGGTGCAGCGCAACGA 1760 
AATCCGCTTCTAAGAGCTGCAATAGAGTTGTGTCTGAACC 1800 
1810 1820 1830 1840 

' 1 ' ' I r t t i I i i i i 1 t i i i I i.i. l , , , , I , , , , I , . , . I 

TTGCCTTTTGCCCAATATGCTGTTAATTAGTTTGTAAGGC 1840 
TAAGTGi AGTAGAGGAAAATCGGGGGAAATCGGCAGCAAA 1880 
GATCATl CAGCCTAGGAGAGATGCATCCGAAGCACATTTC 1920 
CATGTCAACAATGCACTTTTGTATATCGTAAGCATATATA 1960 
TA I CGTATATCGTAAACGTAGTTGTATCTGCATTTGTGTA 2000 

2010 2020 2030 2040 
1 ' ■ ' 1 ' ' ' ' I ' ' ■ ■ < ' ■ ' ' t ■ ' ■ ■ i ■ ■ ■ t i ■ i ■ i i i . ■ . I 

GATGATAGCCTCCTATACGCATTTCAATTGTTTTTAGCGT 2040 
GC TAAAGAACCTTGTTAAATGCAATTTCAGCTATTGTTTA 2080 
GTCAGTTTTAGTGGCATTTACACTTCCATTCTCGTTGCGT 2120 
TTCGT l TTTGCCTGTACATATGAGAAGCTCTGATGTTTTT 2160 
GTATCAAATAAAGTTTTTTCCTTCACCACGGACCACGTGA 2200 

2210 2220 2230 2240 

1 1 1 ' 1 1 1 1 1 1 ' ' ' ' 1 ' 1 ' ' 1 ■ ■ ■ • 1 ■ ' ■ i ' ■ ' ■ i i . ■ . . i 

AAAAAAAAAAAAAAAAAAAAA 2221 



dmFATP partial.protein 



10 20 30 40 

■ ■ ■ ' i ■ ' ■ * I ■ ' ■ ■ i ' ' ■ ' I ■ ' ' ■ i ' ■ ■ ■ I ' ■ ' ■ i * ■ ■ ■ I 

ALWAYIKLLRYTKRHERLNYTVADVFERNVQAHPDKVAVV 40 
SETQRWTFRQVNEHANKVANVLQAQGYKKGDVVALLLENR 80 
. AEYVATWLGLSK IGV I TPLI NTNLRGPSLLHSITVAHCSA 120 
r. L I YGEDFLEAVTDVAKDLPANLTLFGFNNENNNSETEKN I 160 
I; PQAKNLNALLTTASYEKPNKTQVNHHDKLVY I YTSGTTGL 200 

j'j 210 220 230 240 

I t I t I 1 f f I 1 T. 1 I.. I t I 1 ! I I T I 1 1 I I I I 1 I I I t T ! . 1 t 1 t I 

• PKAAVISHSRYLF1 AAGIHYTMGFGEEDIFYTPLPLYHTA 240 
GG I MCMGQSVLFGSTVS IRKKFSASNYFADCAKYNATIGQ 280 
Y IGEMARY ILATKPSEYDQKHRVRLVFGNGLRPQ IWPQFV 320 
QRFNI AKVGEFYGATE-GNANI MNHDNTVGAI GFVSR ILPK 360 
I YPIS I IRADPDTGEP I RDRNGLCQLCAPNEPGVF I GK I V 400 

410 420 430 440 

■ ■ ' ■ 1 ■ < ' ■ I < ■ ' i i ■ i ■ ' I ■ ■ . i i t . i i I ... i i t . i i ) 

KGNPSREFLGYVDEKASAKK I VKDVFKHGOMAF I SGDLLV 440 
' ADEKGYLYFKDRTGDTFRWKGENVSTSEVEAQVSNVAGYK 480 

DTVVYGVTIPHTEGRAGMAAI YDPERELDLDVFAASLAKV 520 
'; LPAYARPQ I IRLLTKVDLTGTFKLRKVDLQKEGYDPNAIK 560 
! DALYYQTSKGRYELLTPQVYDQVGRNEIRF 590 



i 
I 



ft. 73 



f 10 20 30 40 

T ■ ■ i i ■ ' « i 1 i i ■ i i t ■ ■ ■ I ■ ' ■ ' i < 1 ■ 1 I ' 1 1 * i 1 ' ' 1 I 

AGTGTAGATACCACAGGAACGTTTAAAATCCAGAAGACCA 40 
GACTGCAAAGGGAAGGATACGATCCACGGCTCACAACTGA 80 
CCAGATCTACTTCCTAAACTCCAGAGCAGGGCGTTACGAG 120 
CTTGTCAACGAGGAGCTGTACAATGCATTTGAACAAGGGC 160 
AGGATTTCCCTTT 173 



drFATP partial.protein 



10 20 30 40 

i i i i 1 » * i i I i i i t f t t r t I f t f r J t i f f I i r r > I i i i t 1 

SVDTTGTFK IQKTRLQREGYDPRLTTOQ I YFLNSRAGRYE 40 
LVNEELYNAFEQGQDFP 57 




ceFATPa coding only.DNA 



f 10 20 30 40 

. ■ ■ ■ i ■ i ■ ■ I ■ - ' ■ i ■ < ' ■ 1 « ' ' ' ' ' ■ ' ' I ' ' ' ' 1 ' 1 ' '■ ! 

ATGAAGCTGGAGGAGCTTGTGACAGTTATGCTTCTCACAG 40 
TGGCTGTCATTGCTCAGAATCTTCCGATTGGAG I AATATT 80 
GGCTGGAGTTCTTATTTTATACATCACAGTGGTTCATGGA 120 
GATTTCATTTATAGAAGTTATCTTACGTTGAA t AGGGATT 160 
TAACAGGATTGGCTCTAATTATTGAAGTCAAAATCGACCT 200 

210 220 230 240 

f . . . I r ■ . . I ■ ' r I I ■ I I t ! r I ■ I I I I I I 1 1 1 I I I I 1 T ! I 

ATGGTGGAGGTTGCATCAGAATAAAGGAATCCATGAACTG 240 
TTTTTGGATATTGTGAAAAAGAATCCAAATAAGCCGGCGA 280 
TGATTGACATCGAGACGAATACAACAGAAACATACGCAGA 320 
GTTCAATGCACATTGTAATAGATATGCCAATTATTTCCAG 360 
GGTCTTGGCTATCGATCCGGAGACGTTGTCGCCTTGTACA 400 

410 420 430 440 

■ ■III I I I ■ I 1 I T I I 1 I ■ I I till! I I I I I I 1 I 1 t I I I 1 1 

TGGAGAACTCGGTCGAGTTTGTGGCCGCGTGGATGGGACT 440 
CGCAAAAATCGGAGTTGTAACGGCTTGGATCAACTCGAAT 480 
TTGAAAAGAGAGCAACTTGTTCATTGTATCAC I GCGAGCA 520 
AGACAAAGGCGATTATCACAAGTGTAACAC I I CAGAA l AT 560 
TATGCTTGATGCTATCGATCAGAAGCTGTTTGATGTTGAG 600 

610 620 630 640 

■ . ■ . i . i ■ ■ i • ' ■ - 1 ■ < • ' 1 ■ ■ ' ■ ' ' ' 1 1 I 1 1 1 1 1 ' ' ' 1 I 

GGAATTGAGGTTTACTCTGTCGGAGAGCCCAAGAAGAATT 640 
CTGGATTCAAGAATCTCAAGAAGAAGTTGGA i GCTCAAAT 680 
TACTACGGAACCAAAGACCCTTGACATAGTAGATTl_TAAA 720 
AGTATTCTTTGCTTCATCTATACAAGTGGTACTAC I GGAA 760 
TGCCAAAAGCCGCTGTCATGAAGCACTTCAGATATTACTC 800 

810 820 830 840 

■ • ■ < I i i ■ ■ ! t t i i I ■ ■ i i I t i t i I i i i i 1 i i i i I i ■ i i I 

GATTGCCGTTGGAGCCGCAAAATCATTCGGAATCCGCCCT 840 
TCTGATCGTATGTACGTCTCGATGCCAATTTATCACACTG 880 
CAGCTGGAATTCTTGGAGTTGGGCAAGCTCTGTTGGGlGG 920 
ATCATCGTGTGTCATTAGAAAAAAATTCTCGGC I AGCAAC 960 
TTTTGGAGGGATTGTGTAAAGTATGATTGTACAGTTTCAC 1000 

1010 1020 1030 1040 

■ ■■■I . i ■ . I i i i ■ i . i ■ i l i i i i I i i t i I i i i i I i i i t I 

AATACATTGGAGAGATTTGTCGGTACTTGTTGGCTCAGCC 1040 
AGTTGTGGAAGAGGAATCCAGGCATAGAATGAGATTGTTG 1080 
GTTGGAAACGGACTCCGTGCTGAAATCTGGCAACCATTTG 1 120 
TAGATCGATTCCGTGTCAGAATTGGAGAACT i TATGGTTC 1 160 
AACTGAAGGAACTTCATCTCTCGTGAACATTGACGGACA I 1200 



Fr*%A 



ceFATPa coding only.DNA 



1210 1220 1230 1240 

, . t . I . ■ < ■ I t ■ . I I I I I t I ■ I « I 1 1 ' t ' I ' T I I I 1 I I I I 

GTCGGAGCTTGCGGATTCTTGCCAATATCCCCATTAACAA 1240 
AGAAAATGCATCCGGTTCGATTAATTAAGGTTGATGATGT 1280 
CACTGGAGAAGCAATCCGAACTTCCGATGGACTTTGCATT 1320 
GCATGTAATCCAGGAGAGTCTGGAGCAATGGTGTCGACGA 1360 
TCAQAAAAAATAATCCATTATTGCAATTCGAGGGATATCT 1 400 

1410 1420 1430 1440 

■ . . ■ 1 ■ ■ t I 1 I I ■ ■ 1 . ■ ■ ■ I ■ ' I I I t I T I I 1 I I I I 1 I I I 1 

GAATAAGAAGGAAACGAATAAAAAGATTATCAGAGATGTC 1 440 
TTCGCAAAGGGAGATAGTTGCTTTTTGACTGGAGATCTTC 1480 
TTCATTGGGATCGTCTTGGTTATGTATATTTCAAGGATCG 1520 
TACTGGAGATACTTTCCGTTGGAAGGGAGAGAATGTGTCG 1560 
ACTACTGAAGTCGAGGCAATTCTTCATCCAATTACTGGAT 1600 

1610 1620 1630 1640 

• ■ ■ ■ t . ■ ■ ■ 1 ■ ■ ■ ■ 1 ■ ■ ' ' ! 1 ' ■ ■ 1 ' 1 1 ■ I ' ' ' ' 1 1 1 1 1 I 

TGTCTGATGCAACTGTTTATGGTGTAGAGGTTCCTCAAAG 1640 
AGAGGGAAGAGTTGGAATGGCGTCAGTTGTTCGAGTTGTA 1680 
TCGCATGAGGAAGATGAAACTCAATTTGTTCATAGAGTTG 1720 
GAGCAAGACTTGCCTCTTCGCTTACCAGCTACGCGATTCC 1760 
TCAGTTTATGCGAATTTGTCAGGATGTTGAGAAAACAGGT 1800 

1810 1820 1830 1840 

.... i i ... t i ■ i. i i ' t ■ I t < i i > i t i i 1 i i i i I i i i i t 

ACATTCAAACTTGTGAAGACGAATCTACAACGATTAGGTA 1840 
TCATGGATGCTCCTTCAGATTCAATTTACATCTACAATTC 1880 
TGAAAATCGCAATTTTGTGCCGTTCGACAATGATTTGAGG 1920 
TGCAAGGTCTCACTGGGAAGTTATCCATTTTAA 1953 



ceFATPa coding onl y.protein 



10 20 30 40 

i , t . t i , . i I i i i t i ■ i i i i ■ ■ ■ ' t ■ ■ ■ ■ l ' ' ' ' i ' ' ■ ' I 
MKLEELVTVMLLTVAV I AQNLP I GV I LAGVL I LY I TVVHG 40 
OFI YRSYLTLNRDLTGLAL I I EVK I DLWWRLHQNKG I HEL 80 
FLD I VKKNPNKPAM I D I ETNTTETYAEFNAHCNRYANYFQ 120 
I GLGYRSGOVVALYMENSVEFVAAWMGLAKIGVVTAWINSN 160 
j LKREQLVHC I TASKTKAI I TSVTLQN I MLDAI DQKLFDVE 200 
' 210 220 230 240 

■ till t t i I 1 t I » t t t I I t I i I I ■ I I » I 1 I 1 I I 1 I 1 I I I I 

G I EVYSVGEPKKNSGFKNLKKKLDAQ ITTEPKTLD i VDFK 240 
S ILCF I YTSGTTGMPKAAVMKHFRYYS I AVGAAKSFGI RP 280 
SDRMYVSMP I YHTAAG ILGVGQALLGGSSCV I RKKFSASN 320 
FWRDCVKYDCTVSQY I GE I CRYLLAQPVVEEESRHRMRLL 36,0 
VGNGLRAE IWQPFVDRFRVR I GELYGSTEGTSSLVN I DGH 400 
410 420 430 440 

. t t ■ I i i i i 1 i i ■ i I i i ■ . I i i i i I t i i i I i r i i [ 1 I t I I 

'■ VGACGFLP ISPLTKKMHPVRLIKVDDVTGEAI RTSDGLC I 440 
ACNPGESGAMVST I RKNNPLLQFEGYLNKKETNKK I IRDV 480 
FAKGDSCFLTGOLLHWDRLGYVYFKDRTGDTFRWKGENVS 520 
TTEVEAILHP I TGLSDATVYGVEVPQREGRVGMASVVRVV 560 
SHEEDETQFVHRVGARLASSLTSYAIPQFMR ICQOVEKTG 600 

610 620 630 640 

. ■ . i i ■ i ■ i l ■ ■ ■ ■ i i ■ ■ ■ i ■ ■ ' * i ■ > ' ' I ' ' ■ ' I ' ■ ' ' I 

' TFKLVKTNLQRLGI MQAPSDS I Y I YNSENRNFVPFDNDLR 640 
! CKVSLGSYPF. 651 



ceFATPb coding only.DNA 



10 20 30 40 

, . . i i . ■ ■ < I i ■ ■ > I . i ■ . I i i t i i i » i t 1 i i i i I i i i i ! 

ATGAGGGAAATGCCGGACAGTCCCAAGTTTGCGTTAGTCA 40 
CGTTTGTTGTGTATGCAGTGGTTTTGTACAATGTCAACAG 80 
CGTTTTCTGGAAATTTGTATTCATCGGATATGTTGTATT I 120 
AGGCTGCTTCGCACTGATTTTGGAAGAAGAGCACTTGCCA 160 
CGTTACCTAGAGATTTTGCGGGACTGAAGCTCTTAATA ! C 200 

210 220 230 240 

t i i r ( > > t t ! i i i i 1 iitil i t t t I t i i t I i ! r l 1 T t l I I 

GGTTAAGTCGACAATTCGTGGCTTGTTCAAGAAAGATCGC 240 
CCAATTCATGAAATCTTTTTGAATCAGGTGAAACAGCATC 280 
CAAACAAAGTGGCGATTATTGAAATTGAAAGTGGTAGGCA 320 
GTTGACGTATCAAGAATTGAATGCGTTAGCTAATCAGTAT 360 
GCTAACCTTTACGTGAGTGAAGGTTACAAAATGGGCGACG 400 

410 420 430 440 

. . t ■ i • ■ t i I i t i i i . i ■ i I i i i i i t i i i ] i i i i l i < i i I 

TTGTCGCTTTGTTTATGGAAAATAGCATCGACTTCTTTGC 440 
AATTTGGCTGGGACT-TTCCAAGATTGGAGTCGTGTCGGCG 480 
TTCATCAACTCAAACTTGAAGTTGGAGCCATTGGCACAT i 520 
CGATTAATGTTTCGAAGTGCAAATCATGCATTACCAATAT 560 
CAATCTGTTGCCGATGTTCAAAGCCGCTCGTGAAAAGAAT 600 

610 620 630 640 

■ ■ t 1 I .... I t 1 I I I I ! I T I I 1 1 I I I I 1 I I I 1 I 1 I I 1 I I I 

CTGATCAGTGACGAGATCCACGTGTTTCTGGCTGGAACTC 640 
AGGTTGATGGACGTCATAGAAGTCTTCAGCAAGATCTCCA 680 
TCTTTTCTCTGAGGATGAACCTCCAGTTATAGACGGAC I C 720 
AATTTTAGAAGCGTTCTGTGTTATATTTACACTTCCGGTA 760 
CTACCGGAAATCCAAAGCCAGCCGTCATTAAACACTTCCG 800 

810 820 830 840 

. . i ■ i i • ■ . I i ■ • ■ i ■ i ■ i I i ■ ■ ■ i ' ' ' ' I ' ' ' ' I ' ' ' i I 

TTACTTCTGGATTGCGATGGGAGCAGGAAAAGCATTTGGA 840 
ATTAATAAGTCAGACGTTGTGTACATTACGATGCCAATGT 880 
ATCACTCTGCCGCCGGTATCATGGGTATTGGATCATTAAT 920 
TGCATTCGGGTCGACCGCTGTTATTAGGAAAAAGTTTTCG 960 
GCAAGCAACTTCTGGAAAGATTGCGTCAAGTACAACGTCA' 1000 

1010 1020 1030 1040 

i . . ■ i i i t . I t < i i i i ■ < < I ■ i ■ ' i ' ' ■ ' I 1 ■ ■ ' 1 1 ' 1 1 I 

CAGCGACACAGTACATTGGAGAAATCTGC AGGT ATCITCT 1040 
GGCAGCGAATCCATGTCCTGAAGAGAAACAACACAACG I G 1080 
CGATTGATGTGGGGAAATGGTTTGAGAGGACAAATTTGGA 1 120 
AAGAGTTTGTAGGAAGA-TTTGGAATTAAGAAAATTGGAGA 1 160 
GTTGTACGGCTCAACAGAAGGAAACTCCAATATTGTTAAC 1200 



1210 1220 1230 1240 

I t T I t I t r t I t I T 1 1 t I 1 I j I I T 1 1 I I T T I t ( t I 1 t 1 I f t 

GTGGATAACCATGTTGGAGCTTGTGGATTCATGCCAATTT 1240 
ATCCCCATATTGGATCCCTCTACCCAGTTCGACTTATTAA 1280 
GGTTGATAGAGCCACTGGAGAGCTTGAACGTGATAAGAAC 1320 
GGACTCTGTGTGCCGTGTGTGCCTGGTGAAACTGGGGAAA 1360 
TGGTTGGCGTTATC AAGGAGAAAGATATTCTTCTAAAGTT 1 400 

1410 1420 1430 1440 

1 T T I f 1 I 1 T ! T T I I F I t I T I t 1 I 1 I 1 1 1 I I t I I I 1 I I t 1 I 

CGAAGGATATGTCAGCGAAGGGGATACTGCAAAGAAAATC 1 440 
TACAGAGATGTGTTCAAGCATGGAGATAAGGTGTTTGCAA 1480 
GTGGAGATATTCTTCATTGGGATGATCTTGGATACTTGTA 1520 
CTTTGTGGACCGTTGTGGAGACACTTTCCGTTGGAAAGGG 1560 
GAGAACGTGTCAACTACTGAAGTTGAGGGAATTCTTCAGC 1600 

1610 1620 1630 1640 

i < t * 1 , r , | , f , i f < t i i ! I , r t f I ttttl ititl r t t f I Tiitl 

CTGTGATGGATGTGGAAGATGCAACTGTTTATGGAGTCAC 1640 
TGTCGGTAAAATGGAGGGGCGTGCCGGAATGGCTGGTATT 1680 
GTCGTCAAGGATGGAACGGATG-TTGAGAAATTCATCGCCG 1720 
ATATTACTTCTCGACTGACCGAAAATCTGGCGTCTTACGC 1760 
AATCCCTGTTTTCATTCGGCTGTGCAAGGAAGTTGATCGA 1800 

1810 1820 1830 1840 

■ ' ' ' i ■ ' ' ■ I ■ ' ' ' i ■ ■ ■ ' I ■ ■ ' ■ i ■ ' ' ' I ' ■ ■ ■ i i ' ■ ' I 

ACCGGAACCTTCAAACTCAAGAAGACTGATCTTCAAAAAC 1840 
AAGGTTACGACCTGGTTGCTTGTAAAGGAGACCCAATTTA 1880 
CTACTGGTCAGCTGCAGAAAAATCCTACAAACCACTGACT 1920 
GACAAAATGCAACAGGATATTGACACTGGTGTTTATGA.TC 1960 
GCATTTAA 1968 



fi 4 .7?8 



ceFATPb coding only.protein 



10 20 30 40 

* * 1 » I t T t t I f I T > f I I X 1 1 I T r * I f 1 t I 1 I I t 1 1 f f T I f 

MREMPDSPKFALVTFVVYAVVLYNVNSVFWKFVF IGYVVF 40 
RLLRTDFGRRALATLPRDFAGLKLL I SVKST I RGLFKKOR 80 
P IHE IFLNQVKQHPNKVA I I EI ESGRQLTYQELNALANQY 120 
ANLYVSEGYKMGDVVALFMENS I OFFAI WLGLSK IGVVSA 160 
F I NSNLKLEPLAHS I NVSKCKSC I TN INLLPMFKAAREKN 200 

210 220 230 240 

''''I r t r r I r t t r 1 t t r i 1 » i t i ! i r i t j i i t i t t t r r I 



LISDE IHVFLAGTQVQGRHRSLQQDLHLFSEDEPPV I DGL 240 
NFRSVLCY I YTSGTTGNPKPAV I KHFRYFWI AMGAGKAFG 280 
INKSDVVY ITMPMYHSAAG I MG I GSL I AFGSTAV I RKKFS 320 
ASNFWKDCVKYNVTATQY I GE I CRYLLAANPCPEEKQHNV 360 
RLMWGNGLRGQ I WKEFVGRFG IKK IGELYGSTEGNSNI VN 400 

410 420 430 4'40 
■ ■ * ' 1 ■ ■ ■ ■ I * » > ■ i ■ i i i I ■ i i t i i i i i 1 i i i i i i i . ■ i 



VONHVGACGFMPI YPH I GSLYPVRL I K VDRATGELERDKN 440 
GLCVPCVPGETGEMVGV I KEKD I LLKFEGYVSEGDTAKK I 480 
YRDVFKHGDKVFASGD I LHWDDLGYLYFVDRCGDTFRWKG 520 
ENVSTTEVEG I LQPVMDVEDATVYGVTVGKNEGRAGMAG I 560 
VVKDGTDVEKFI AO I TSRLTENLASYA IP VF IRLCKEVDR 600 

610 620 630 640 
' ' ' ■ i ' ' ' ■ I ' ■ ' ■ i » ' ■ ' I ' ' ' ' * ' ' ' ' I ' ■ ' > ' ■ ' * ■ I 



TGTFKLKKTDLQKQGYDLVACKGDP I YYWSAAEKSYKPLT 640 
OKMQGD IOTGVYDR I. 656 



chFATP coding only.DNA 



f 10 20 30 40 

■ ■ . . 1 i T . . I ■ . t ■ I . . . i I i ■ t 1 I l I I t I I I \ I 1 I I [ I 1 

ATGGCGTGTATGCATCAGGCTCAGCTATACAATGATCTAG 40 
AGGAATTGCTAACTGGTCCATCAGTACCCATCGTTGCTGG 80 
AGCTGCTGGAGCTGCAGCTCTCACTGCCTACATTAACGCC 120 
AAATACCACATAGCCCATGATCTCAAGACCCTCGGTGGTG 160 
GATTGACACAATCGTCCGAAGCGATTGATTTCATAAACCG 200 

21,0 220 230 240 

.... i .... I t i .. i t ■ .. l t ■ i . i i i i i 1 t f i i I i i i i I 

CCGCGTCGCACAAAAGCGCGTCCTCACGCACCACATCTTC 240 
CAGGAGCAGGTCCAAAAACAATCAAATCATCCCTTTCTTA 280 
TCTTTGAGGGCAAGACATGGTCTTACAAGGAGTTCTCTGA 320 
GGCATACACGAGGGTCGCGAACTGGCTGATTGATGAGC l G 360 
GACGf ACAAGTAGGGGAGATGGTCGCAATTGATGGCGGAA 400 

410 420 430 -440 

t . i i i i i i i I t i i t i i i > i I i i i i l. i i i t 1 i i i i 1 i t t i 1 

ATAGTGCAGAGCACCTGATGCTTTGGCTTGCACTTGATGC 440 
AATCGGTGCGGCTACGAGTTTTTTGAACTGGAACCTGACA 480 
GGGGCAGGGTTAATTCATTGCATAAAGCTATGCGAATG I C 520 
GATTCGTTATCGCAGACATCGATATTAAAGCGAACATTGA 560 
ACCGTGCCGTGGCGAACTGGAGGAGACGGGCATCAACATT 600 

610 620 630 640 

. . . i i i . t i l i i i ■ i t i i t I i i i t i t i i i I i i t i I i t i i t 

CACTACTATGACCCATCCTTCATCTCATCGCTACCGAATA 640 
ACACGCCAATTCCCGACAGCCGCACTGAGAACAT 1 GAA ! T 680 
AGATTCAGTACGAGGACTGATATACACATCTGGAACCACT 720 
GGTCTACCTAAAGGCGTGTTTATAAGCACTGGCCGCGAGC 760 
TTAGGACTGACTGGTCGATTTCAAAGTATCTAAA I CTCAA 800 

810 820 830 840 

■ . . . i . . ■ ■ I ■ t . . i . . ■ . I ■ ■ ■ ■ i < ■ ■ i I i t i i I t i i i ! 

GCCCACGGATCGAATGTATACATGTATGCCGCTCTACCAT 840 
GCCGCTGCACACAGCCTCTGTACAGCATCAGTTATTCATG 880 
GTGGAGGTACCGTGGTATTGAGCAGGAAATTCTCACACAA 920 
GAAGTTCTGGCCTGAAGTTGTGGCTTCGGAAGCAAATATC 960 
ATTCAGTACGTTGGTGAATTAGGTCGATATCTCCTGAATG 1000 

1010 1020 1030 1040 

■ ■ . ■ ) . . ■ ■ I ■ r . . I i ■ ■ ■ I ■ ■ i t 1 t < ' < I t ■ ' t 1 I I I I 1 

GTCCAAAGAGTCCTTACGACAGGGCCCATAAAGTCCAGAT 1040 
GGCGTGGGGCAAT'GGCATGCGTCCAGACGTGTGGGAAGCG 1080 
TTTCGTGAACGCTTCAACATACCAATTATTCATGAGCTCT 1 120 
ATGCCGCAACCGATGGGCTCGGGTCAATGACCAATCGTAA 1 160 
CGCGGGCCCTTTTACAGCAAACTGTATTGCGCTGCGAGGG 1200 



chFATP coding only.DNA 



1210 1220 1230 1240 

■ ■ i t ! ■ t i i ! i i i < I i i ■ i I i ■ ' < I ■ i ■ i I i > i t 1 i i i i 1 

CTGATCTGGCACTGGAAATTTCGAAATCAGGAAGTGCTGG 1240 
TCAAGATGGATCTCGATACTGATGAGATCATGAGAGATCG 1280 
CAATGGGTTTGCGATACGATGCGCTGTCAATGAACCTGGA 1320 
CAGATGCTTTTTCGGCTGACACCCGAAACTCTGGCTGGTG 1360 
CACCAAGCTACTACAACAACGAAACGGCCACACAGAGCAG 1400 

4 1410 1420 1430 1440 

\ t * , f i t r . 1 t f i i ! i i t t ! f i t i I i r t i 1 t r i t I t i I I ! 

GCGGATTACAGATGTGTTTCAAAAGGGTGACCTGTGGTTC 1440 
AAGTCCGGTGACATGCTACGGCAAGACGCCGAAGGCCGCG 1480 
TCTACTTTGTCGATCGACTAGGCGATACGTTCCGCTGGAA 1520 
ATCCGAAAACGTTTCTACCAATGAAGTCGCGGACGTGATG 1560 
GGCACATTTCCTCAGATTGCTGAAACGAATGTATACGGTG 1600 

1610 1620 1630 1640 

* i i 1 I i t t f I r « f r 1 f l 1 l I i i t i I t t T r \ r t r i 1 t t I I 1 

TCCTTGTGCCGGGTAACGATGGTCGAGTGCGCAGCCTCAA 1640 

TTGTCATGGCAGACGGCGTGACAGAGTCGACATTCGCTTC 1680 

GCTGCCCTTGCAAAGCACGCCCGAGATCGGTTACCGGGTT 1720 

ATGCTGTACCACTGTTTCTGAGGGTAACTCCAGCACTTGA 1760 

ATATACGGGCACATTAAAGATTCAGAAAGGACGCCTCAAG 1800 

1810 1820 1830 1840 

f r r I 1 t i } t 1 f t t i I i t I T \ t t \ i 1 f l 1 f 1 t t i i t i i t i 1 

CAGGAAGGTATAGACCCAGATAAGATTTCCGGCGAAGATA 1840 

AGTTATACTGGCTGGCGCCTGGTAGCGATATATATTTACC 1880 

ATTTGGAAAGATGGAGTGGCAGGGAATTGTAGATAAGCGT 1920 
ATACGGCTGTGA 1932 



chFATP coding only.protein 



/ 

10 20 30 40 

1 i ■ I I t i i I I ? i « i ! i r i i 1 i I i I I trill t 1 I I I 1 I 1 I I 

MACMHQAQLYNDLEELLTGPSVP I VAGAAGAAALTAY INA 40 
KYH I AHDLKTLGGGLTQSSEA I OF I NRR VAQKRVLTHH I F 80 
QEQVQKQSNHPFLIFEGKTWSYKEFSEAYTRVANWLIDEL 120 
DVQVGEMVAI DGGNSAEHLMLWLALDAI GAATSFLNWNLT 160 
GAGL I HC 1 KLCECRFV I AD ID I KAN I EPCRGELEETG INI 200 
210 220 230 240 

■ ■ ■ i I t ■ ■ ■ I < ■ ■ ' ' ' t ■ < 1 r . ■ . I I . . I I I I I 1 \ I I ! I I 

HYYDPSF I SSLPNNTP I PDSRTEN I ELDSVRGL I YTSGTT 240 
GLPKGVF I STGRELRTDWSISKYLNLKPTDRMYTCMPLYH 280 
AAAHSLCTASV IHGGGTVVLSRKFSHKKFWPEVVASEANI 320 
I QYVGELGRYLLNGPKSPYDRAHKVQMAWGNGMRPDVWEA 360 
FRERFNIP 1 1 HELYAATDGLGSMTNRNAGPFTANC I ALRG 400 
410 420 430 440 

■ ■ ' ' I I ■ I « I I 1 I I 1 1 T I I I I ■ 1 1 I I I T 1 I t I I I I I I I 1 1 

L I WHWKFRNQEVLVKMDLDTDE I MRDRNGFA I RCAVNEPG 440 
QMLFRLTPETLAGAPSYYNNETATQSRR I TDVFQKGDLWF 480 
KSGDMLRQDAEGRVYFVDRLGDTFRWKSENVSTNEVADVM, 520 
GTFPQ I AETNVYGVLVPGNDGRVRSLNCHGRRRDRVD I RF' 560 
AALAKHARDRLPGYAVPLFLRVTPALEYTGTLK I QKGRLK 600 

610 620 630 640 

■ ■ ■ ■ i > ■ ■ ■ I ■ ■ < ' ' ' ' < ' ' ' ' ' ' I ■ ' ' ■ I ' ' ' ' I ' ' ■ ' 1 

QEG I DPDK I SGEDKLYWLPPGSD I YLPFGKMEWQG I VDKR 640 
IRL 643 



ft,. «| 

0> 



aspergillus partial.DNA 



10 20 30 40 

■ , . / 1 , ■ ■ ■ i i . ■ ■ i < ' ' ■ ' ■ ' ■ ' ' ' ■ ' ■ I ' ■ ' ' i ' ' ' ' 1 

CTTTACCATTCATCAGCTTCATTCTGCATTTTTAGCTTGA 40 
CGGCAGCCGGGTCTACGCTGATCATCGGCCGCAAGTTCTC 80 
CGCGAGAAACTTCATAAAGGAAGCGCGCGAGAACGACGCC 120 
ACGGTCATCCAGTACGTGGGTGAGACCTTGCGATATCTGC 160 
TCGCCACCCCCGGTGAAACCGATCCAGTTACTGGCGAAGA 200 

210 220 230 240 

. , . ■ I . . ■ ■ I r I ■ ■ 1 ■ ■ t < I i ■ i ' 1 t I 1 I I ? I I I I I I 1 1 1 

CCTGGACAAAAAGCACAATATTCGAGCAGTATACGGCAAC 240 
GGGCTACGGCCGGATATCTGGAACCGCTTCAAGGAGCGCT 280 
TCAACGTGCCGACGGTTGCCGA.ATTTTATGCTGCAACCGA 320 
GAGCCCAGGCGGAACATGGAACTATTCAACAAATGACTTC 360 
ACTGCCGGAGCCATTGGGCACACTGGCGTGCTTAGTGGAT 400 

410 420 430 440 

i.i.l I I I I 1 I I r I I t I I < I I i i I I I I t I I 1 T I ! 1 I I 1 I I 

GGCTTCTTGGACGCGGCCTTACTATTGTCGAGGTGGACCA 440 
GGAATCACAGGAACCATGGCGCGATCCCCAAACCGGGTTC 480 
TGCAAGCCGGTCCCGCGAGGCGAAGCAGGCGAGCTCCTGT 520 
ATGCCATTGATCCGGCCGACCCGGGCGAGACCTTCCAGGG 560 
CTACTACCGCAACTCCTTTAGAGCACACTGGCGGCCG 597 




aspergil'lu s partia l.protein 




f T 1 ] 




LYHSSASFC IFSLTAAGSTLI I GRKFSARNF I KEARENOA 40 
TV I QYVGETLRYLLATPGETDPVTGEDLDKKHN I RAVYGN 80 
GLRPD IWNRFKERFNVPTVAEFYAATESPGGTWNYSTNDF 120 
TAGAIGHTGVLSGWLLGRGLTI VEVDQESQEPWRDPQTGF 160 
CKPVPRGEAGELLYAIDPADPGETFQGYYRNSFRAHWRP 199 




- i 



10 20 30 40 

■ ' ' ■' ' 1 ■ * ' I 1 1 ' ' 1 ' ' ■ 1 I ' - ■ 1 ' ' ' ■ ■ I ■ < ' ' 1 ' ' ' ■ I 
GCAAAGGCCGACGCGTGGCTGCGGACGGGTAACGTGATCA 40 
GGGCGGACAACGAAGGGCGACTCTTCTTCCACGACCGGAT 80 
CGGAGACACGTTCCGATGGAAGGGAGAGACNGTCAGCACA 120 
CAAGAGGTCAGTTTGGTGCTCGGACGACACGACTCAATCA 160 
AGGAGGCCAACGTGTACGGCGTGACGGTGCCGAACCACGA 200 

210 220 230 240 

r t t | , t , L , t r t 1 t f 1 i I iiitl r ] t i | i i t i | t ] i t f t r t t 1 

CGGGCGGGCCGGCTGCGCTGCGCTCACGCTATCAGACGCT 240 
CTGGCGACTGAAAAGAAGCTGGGCGATGAGCTGCTAAAGG 280 
GATTGGCTACTCACTCGTCGACTTCGCTTCCCAAGTTTGC 320 
GGTGCCGCAGTTCCTACGGGTGGTGCGCGGCGAGATGCAG 360 
TCAACGGGCACCAACAAGCAACAGAAGCACGACCTGAGGG 400 

410 420 430 440 

■t t t I f t t i r f i i t f f i i t t f e..f f f f f t t f 1 f r f f ! i i i 1 I 

TGCAGGGTGTAGAGCCGGGCAAGGTGGGCGTAGACGAGGT 440 
GTACTGGTTGCGGGGAGGGACATATGTACCATTCGGAACA 480 
GAGGATTGGGATGGGTTGAAGAAGGGTC T TGTGAAGTTGT 520 
GA 522 



Ft,- H 



mqFATP partial.protein 



10 20 30 40 

l > 1 I I 1 1 I 1 t I 1 1 1 * * 1 t I I I T 1 I T t 1 1 1 1 I I 1 1 I I 1 1 1 

AKADAWLRTGNV I RADNEGRLFFHDR I GDTFRWKGETVST 40 
QEVSLVLGRHDS I KEANVYGVTVPNHDGRAGCAALTLSDA 80 
LATEKKLGDELLKGLATHSSTSLPKFAVPQFLRVVRGEMQ 120 
STGTNKQQKHOLRVQGVEPGKVGVDEVYWLRGGTYVPFGT 150 
EDWDGLKKGLVKL 173 



scFATP coding oniy.DNA 



10 20 30 40 

^ I t I t t I I 1 .T 1 I I 1 1 T 1 I 1 I t t I I t I I t t I I 1 1 f I 1 I I I 

ATGTCTCCCATACAGGTTGTTGTCTTTGCCTTGTCAAGGA 40 
TTTTCCTGCTATTATTCAGACTTATCAAGCTAAT7ATAAC 80 
CCCTATCCAGAAATCACTGGGTTATCTATTTGGTAATTAT 120 
TTTGATGAATTAGACCGTAAATATAGATACAAGGAGGATT 160 
GGTATATTATTCCTTACTTTTTGAAAAGCGTGTTTTGTTA 200 

210 220 230 240 

■ • ' ■ i ■ ' ■ ' 1 ' ' ■ * 1 ■ * ■ ■ I ' ■ ■ ' i ' ■ ■ ' I ■ ' ■ ■ i ■ ' ' > l 

TATCATTGATGTGAGAAGACATAGGTTTCAAAACTGGTAC 240 
TTATTTATTAAACAGGTCCAACAAAATGGTGACCATTTAG 280 
CGATTAGTTACACCCGTCCCATGGCCGAAAAGGGAGAATT 320 
TCAACTCGAAACCTTTACGTATATTGAAACTTATAACATA 360 
GTGTTGAGATTGTCTCATATTTTGCATTTTGATTATAACG 400 

410 420 430 440 

' t i i f i f i i I i ? t t ? t i i t f t i t f t r f i f f i i t i ? ? f > ; ! 

TTCAGGCCGGTGACTACGTGGCAATCGATTGTACTAATAA 440 
ACCTCTTTTCGTATTTTTATGGCTTTCTTTGTGGAACATT 480 
GGGGCTATTCCAGCTTTTTTAAACTATAATACTAAAGGCA 520 
CTCCGCTGGTTCACTCCCTAAAGATTTCCAATATTACGCA 560 
GGTATTTATTGACCCTGATGCCAGTAATCCGATCAGAGAA 600 

610 620 630 '640 

f f | 1 I I I 1 1 1 j i l . ■* 1 I ] I f I f ! I ^ 1 t I ! \ j f I t t f T 1 I [ , i 

TCGGAAGAAGAAATCAAAAACGCACTTCCTGATGTTAAAT 640 
TAAACTATCTTGAAGAACAAGACTTAATGCATGAACTTTT 680 
AAATTCGCAATCACCGGAATTCTTACAACAAGACAACGTT 720 
AGGACACCACTAGGCTTGACCGATTTTAAACCCTCTATGT 760 
TAATTTATACATCTGGAACCACTGGTTTGCCTAAATCCGC 800 

810 820 830 840 

' ■ ■ ' i ■ 1 1 1 1 ' ' ' 1 1 ■ 1 ' * I ' ■ ' 1 1 ■ ' 1 1 I 1 ■ 1 ■ 1 ■ 1 ' ' ! 

TATTATGTCTTGGAGAAAATCCTCCGTAGGTTGTCAAGTT 840 
TTTGGTCATGTTTTACATATGACTAATGAAAGCACTGTGT 880 
TCACAGCCATGCCATTGTTCCATTCAACTGCTGCCTTATT 920 
AGGTGCGTGCGCCATTCTATCTCACGGTGGTTGCCTTGCG 960 
TTATCGCATAAATTTTCTGCCAGTACATTTTGGAAGCAAG 1000 

1010 1020 1030 1040 

* ^ ^ f ^ Ij??! IfTfl 1 f I t T 1 I f T 1 tfltf Eft?! Ilffl t 

TTTATTTAACAGGAGCCACGCACATCCAATATGTCGGAGA 1040 

AGTCTGTAGATACCTGTTACATACGCCAATTTCTAAGTAT 1080 

GAAAAGATGCATAAGGTGAAGGTTGCTTATGGTAACGGGC 1 120 

TGAGACCTGACATCTGGCAGGACTTCAGGAAGAGGTTCAA 1 160 

CATAGAAGTTATTGGTGAATTCTATGCCGCAACTGAAGCT 1200 



scFATP coding only.DNA 



1210 1220 1230 1240 

, ■ ■ . i • ■ ■ i i ■ ■ ' ■ ' t ■ . ■ i ■ ■ i ■ i » ■ ■ i I i i i i 1 i t i t I 

CCTTTTGCTACAACTACCTTCCAGAAAGGTGACTTTGGAA 1240 
TTGGCGCATGTAGGAACTATGGTACTATAATTCAATGGTT 1280 
TTTGTCATTCCAACAAACATTGGTAAGGATGGACCCAAAT 1320 
GACGATTCCGTTATATATAGAAATTCCAAGGGTTTCTGCG 1360 
AAGTGGCCCCTGTTGGCGAACCAGGAGAAATGTTAATGAG 1400 

/ 1410 1420 1430 1440 

. ■ . I I . ■ ■ ' I ■ ' ■ ■ I ' ■ ' t I I ■ 1 I I 1 1 T I I I T I I I I T I I I 

AATCTTTTTCCCTAAAAAACCAGAAACATCTTTTCAAGGT 1440 
TATCTTGGTAATGCCAAGGAAACAAAGTCCAAAGTTGTGA 1 480 
GGGATGTCTTCAGACGTGGCGATGCTTGGTATAGATGTGG 1 520 
AGATTTATTAAAAGCGGACGAATATGGATTATGGTATTTC 1 560 
CTTGATAGAATGGGTGATACTTTCAGATGGAAATCTGAAA 1600 

1610 1620 1630 1640 

f r } r f t r i i \ i i t t 1 * t t t 1 i 1 i t f i t t I I i i t I 1 l 1 } t 1 

ATGTTTCCACTACTGAAGTAGAAGATCAGTTGACGGCCAG 1640 
TAACAAAGAACAATATGCACAAGTTCTAGTTGTTGGTATT 1680 
AAAGTACCTAAATATGAAGGTAGAGCTGGTTTTGCAGTTA 1 720 
TTAAACTAACTGACAACTCTCTTGACATCACTGCAAAGAC 1760 
CAAATTATTAAATGATTCCTTGAGCCGGTTAAATCTACCG 1800 

1810 1820 1830 1840 

. . i i i i t t ■ l i ■ i ' i ' i i > 1 ■ ' ■ ' ' ' ■ ' ' I ' ■ ' < 1 * ' ' * I 

TCTTATGCTATGCCCCTATTTGTTAAATTTGTTGATGAAA 1840 
TTAAAATGACAGATAACCTCATAAAATTTTGA 1872 



scFATP coding only.protein 



10 20 30 40 

■ ■ » < t » ■ ' ' * ■ ' ' ■ ' 1 ' ■ 1 i ■ ■ ' ■ 1 1 ' ' ' i ' * ■ ■ 1 ■ ' ■ ■ i 

MSP IQVYVFALSRIFLLLFRL IKLI I TP I QKSLGYLFGNY 40 
FQELDRKYRYKEOWY I IPYFLKSVFCYI I QVRRHRFQNWY 80 
LF IKQVQQNGDHLAI SYTRPMAEKGEFQLETFTY IETYN I 120 
VLRLSH ILHFDYNVQAGDYVA IQCTNKPLFVFLWLSLWN I 160 
GAIPAFLNYNTKGTPLVHSLK ISNITQVF5DPDASNP IRE 200 

210 220 230 240 

'y i i i I t f * * 1 i * t i f t t i i I i i i t 1 f i r t I i i 1 t f i t i i 1 

SEEEIKNALPOVKLNYLEEQDLMHELLNSQSPEFLQQDNV 240 
RTPLGLTDFKPSMl I YTSGTTGLPKSA I MSWRKSSVGCQV 280 
FGHVLHMTNESTVFTAMPLFHSTAALLGACA I LSHGGCLA 320 
LSHKFSASTFWKQVYLTGATH IQYVGEVCRYLLHTP I SKY 36Q 
EKMHKVKVAYGNGLRPD I WQOFRKRFN I EV I GEFYAATEA 400 

410 420 430 440 

. 1 I 1 1 ^ « i t t 1 j I i t ( liiil ittfl t $ r i ] t i i t f t t t t ! 

PFATTTFQKGDFGI GACRNYGT I I QWFLSFQQTLVRMDPN 440 
DDSV I YRNSKGFCEVAPVGEPGEMLMR IFFPKKPETSFQG 480 
YLGNAKETKSK VVRDVFRRGDAWYRCGDLLKADEYGLWYF 520 
LDRMGDTFRWKSENVSTTEVEDQLTASNKEQYAQVLVVG! 560 
KVPKYEGRAGFAV I KLTDNSLD I TAKTKLLNDSLSRLNLP 600 

610 620 630 640 

■ ■ ■ ' i ■ ■ ■ ' 1 ■ ■ ■ ' < ' < ' > I ■ ■ < ■ i < ■ < ' I ' ■ ■ ■ ' ' ' ■ > I 

SYAMPLFVKFVOEI KMTDNL I KF. 624 




mtFATP coding oniy.DNA 



/ 10 20 30 40 

■ ■ . . I i i i . I • i < > I i i ■ i ! i ■ ' ■ l ' ■ ' ' 1 ■ ' ' ■ I ' ' ' ' 1 

GTGTCCGATTACTACGGCGGCGCACACACAACGGTCAGGC 40 
TGATCGACCTGGCAACTCGGATGCCGCGAGTGTTGGCGGA 80 
CACGCCGGTGATTGTGCGTGGGGCAATGACCGGGCTGC I G 120 
GCCCGGCCGAATTCCAAGGCGTCGATCGGCACGGTGTTCC 160 
AGGACCGGGCCGCTCGCTACGGTGACCGAGTCTTCCTGAA 200 

210 220 230 240 

■ > i t I t > i t I ■ t i i 1 i ■ ■ i 1 t i i < 1 i i i i I i i i t 1 tiiit 

ATTCGGCGATCAGCAGCTGACCTACCGCGACGCTAACGCC 240 
ACCGCCAACCGGTACGCCGCGGTGTTGGCCGCCCGCGGCG 280 
TCGGCCCCGGCGACGTCGTTGGC ATCATGTTGCGTAACTC 320 
ACCCAGCACAGTCTTGGCGATGCTGGCCACGGTCAAGTGC 360 
GGCGCTATCGCCGGCATGCTCAACTACCACCAGCGCGGCG 400 ' 

410 420 430 440 

■ . I I I T I I I I I ■ ■ I I I 1 ■ ' I ' ' ' ' I ' ' ' ' I ' ' ' ' I ■ ' ' ' I 

AGGTGTTGGCGCACAGCCTGGGTCTGCTGGACGCGAAGGT 440 

ACTGATCGCAGAGTCCGACTTGGTCAGCGCCGTCGCCGAA 480 

TGCGGCGCCTCGCGCGGCCGGGTAGCGGGCGACGTGCTGA 520 

CCGTCGAGGACGTGGAGCGATTCGCCACAACGGCGCCCGC 560 

CACCAACCCGGCGTCGGCGTCGGCGGTGCAAGCCAAAGAC 600 

610 620 630 640 

I t i t 1 I t I t I I I I t I trill r T I f I f I I T I I f I r i I 1 t 1 ! 

ACCGCGTTCTACATCTTCACCTCGGGCACCACCGGATTTC 640 
CCAAGGCCAGTGTCATGACGCATCATCGGTGGCTGCGGGC 680 
GCTGGCCGTCTTCGGAGGGATGGGGCTGCGGCTGAAGGGT 720 
TCCGACACGCTCTACAGC TGCCTGCCGCTGTACCACAACA 760 
ACGCGTTAACGGTCGCGGTGTCGTCGGTGATCAATTCTGG 800 

810 820 830 840 

fill! r t i i t i t t t ! r * t 1 1 t t t t t i i f t 1 t i f f f 1 t f t I 

GGCGACCCTGGCGCTGGGTAAGTCGTTTTCGGCGTCGCGG 840 
TTCTGGGATGAGGTGATTGCCAACCGGGCGACGGCGTTCG 880 
TCTACATCGGCGAAATCTGCCGTTATCTGCTCAACCAGCC 920 
GGCCAAGCCGACCGACCGTGCCCACCAGGTGCGGGTGATC 960 
TGCGGTAACGGGCTGCGGCCGGAGATCTGGGATGAGTTCA 1000 

1010 1020 1030 1040 

■ ■ ■ ■ i ' ' ■ < I ' ■ • ■ 1 ■ ■ ' ■ 1 ' 1 ■ ' 1 1 1 1 1 I ' 1 1 ' 1 1 1 ' ■ I 

CCACCCGCTTCGGGGTCGCGCGGGTGTGCGAGTTCTACGC 1040 
CGCCAGCGAAGGCAACTCGGCCTTTATCAACATCTTCAAC 1080 
GTGCCCAGGACCGCCGGGGTATCGCCGATGCCGCTTGCCT 1 120 
TTGTGGAATACGACCTGGACACCGGCGATCCGCTGCGGGA 1 160 
TGCGAGCGGGCGAGTGCGTCGGGTACCCGACGGTGAACCC 1200 



fa. m 



mtFATP coding only.DNA 



1210 1220 1230 1240 

t f ' ' j 1 1 1 1 I L t l l \ t I I T 1 t f t t \ 1 I \ t 1 1 t T t t ( I I t 1 

GGCCTGTTGCTTAGCCGGGTCAACCGGCTGCAGCCGTTCG 1240 
ACGGCTACACCGACCCGGTTGCCAGCGAAAAGAAGTTGGT 1280 
GCGCAACGCTTTTCGAGATGGCGACTGTTGGTTCAACACC 1320 
GGTGACGTGATGAGCCCGCAGGGCATGGGCCATGCCGCCT 1360 
TCGTC.GATCGGCTGGGCGACACCTTCCGCTGGAAGGGCGA 1400 

' 1410 1420 1430 1440 

^ * > f 1 T ' * 1 ^ i i i t I ittil Ttttl t t i t 1 tiitl tittl 

GAATGTCGCCACCACTCAGGTCGAAGCGGCACTGGCCTCC 1440 
GACCAGACCGTCGAGGAGTGCACGGTCTACGGCGTCCAGA 1480 
TTCCGCGCACCGGCGGGCGCGCCGGAATGGCCGCGATCAC 1520 
ACTGCGCGCTGGCGCCGAATTCGACGGCCAGGCGCTGGCC 1560 
CGAACGGTTTACGGTCACTTGCCCGGCTATGCACTTCCGC 1600 

\ 1610 1620 1630 1640 

: W . i , i 1 i . i i I i i i i I . . i ■ I i i i . I i . ii I i i i i I i i ■ t I 

1 TCTTTGTTCGGGTAG.TGGGGTCGCTGGCGCACACCACGAC 1640 

< GTTCAAGAGTCGCAAGGTGGAGTTGCGCAACCAGGCCTAT 1680 

GGCGCCGACATCGAGGATCCGCTGTACGTACTGGCCGGCC 1720 
O CGGACGAAGGATATGTGCCGTACTACGCCGAATACCCTGA 1760 

i & GGAGGTTTCGCTCGGAAGGCGACCGCAGGGCTAG 1794 



fa*, m 



mtFATP coding only.protein 




MSDYYGGAHTTVRLI DLATRMPRVLADTPV I VRGAMTGLL 40 
ARPNSKASIGTVFQORAARYGDRVFLKFGDQQLTYRDANA 80 
TANRYAAVLAARGVGPGDVVG IMLRNSPSTVLAMLATVKC 120 
GAI AGMLNYHQRGEVLAHSLGLLOAKVL I AESDLVSAVAE 160 
CGASRGRVAGQVLTVEDVERFATTAPATNPASASAVQAKD 200 

210 220 230 240 



TAFY I FTSGTTGFPKASVMTHHRWLRALAVFGGMGLRLKG 240 
; SDTLYSCLPLYHNNALTVAVSSV I NSGATLALGKSFSASR 280 
i FWDEV I ANRATAFVY I GE ICRYLLNQPAKPTDRAHQVRV I 320 
I CGNGLRPEIWOEFTTRFGVARVCEFYAASEGNSAFIN IFN 360 
! VPRTAGVSPMPLAFVEYDLDTGQPLRDASGR VRRVPDGEP 400' 



GLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNT 440 
GDVMSPQGMGHAAFVORLGDTFRWKGENVATTQVEAALAS 480 
OQTVEECTVYG VQ I PRTGGRAGMAA I TLRAGAEFOGQALA 520 
RTVYGHLPGYALPLFVRVVGSLAHTTTFKSRKVELRNQAY 560 
GAD I EOPLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG. 598 
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I I 1 I 



f I I I I 1 1 t I I 
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hsFATP3 

1 cga ccc acg cgt ccg ggg atg ttt gcg age ggc tgg aac cag acg gtg ccg ata gag gaa 
1 MFASGWNQTVPIEE 

61 gcg ggc tec atg get gec etc ctg ctg ctg ccc ctg ctg ctg ttg eta ccg ctg ctg ctg 

15 AGSMAALLLLPLLLLLPIiLL 

121 ctg ctg aag eta cac etc tgg ccg cag ttg cgc tgg ctt ccg gcg gac ttg gee ttt gcg 

35 LLKLHLWPQLRWLPADLAFA 

181 gtg cga get ctg tgc tgc aaa agg get ctt cga get cgc gee ctg gec gcg get gee gec 

55 VRALCCK RALRARALAAAAA 

241 gac ccg gaa ggt ccc gag ggg ggc tgc age ctg gec tgg cgc etc gcg gaa ctg gec cag 

75 DPEGPEGGCSLAWRLAELAQ 

301 cag cgc gec gcg cac acc ttt etc att cac ggc teg egg cgc ttt age tac tea gag gcg 

95 QRAAHTFLIHGSRRFSYSEA 

361 gag cgc gag agt aac agg get gca cgc gec ttc eta cgt gcg eta ggc tgg gac tgg gga 

115 ERE SNRAARAFLRALGWDWG 

421 ccc gac ggc ggc gac age ggc gag ggg age get gga gaa ggc gag egg gca gcg ccg gga 

135 PDGGDSGEGSAGEGERAAPG 

481 gec gga gat gca gcg gec gga age ggc gcg gag ttt gec gga ggg gac ggt gee gec aga 

155 AGDAAAG S GAEFAGG.DGAAR 

541 ggt gga gga gag ccc gec gec cct ctg tea cct $ga gca act gtg gcg ctg etc etc ccc 

175 GGGEPAAPLSPGATVALLLP 

601 get ggc cca gag ttt ctg tgg etc tgg ttc ggg ctg gec aag gee ggc ctg cgc act gec 

195 AGFEFLWLWFGLAKAGLRTA 

661 ttt gtg ccc acc gee ctg cgc egg ggc ccc ctg ctg cac tgc etc cgc age tgc ggc gcg 

215 FVPTALRRGPLLHCLRSCGA 

721 cgc gcg ctg gtg ctg gcg cca gag ttt ctg gag tec ctg gag ccg gac ctg ccc gec ctg 

235 RALVLAPEFLESLEPDLPAL 

761 aga gec atg ggg etc cac ctg tgg get gca ggc cca gga acc cac cct get gga att age 

255 RAMGLHLWAAGPGTHPAGIS 

841 gat ttg ctg get gaa gtg tec get gaa gtg gat ggg cca gtg cca gga tac etc tct tec 

275 DLLAEVSAEVDGPVPGYLSS 

901 ccc cag age ata aca gac acg tgc ctg tac ate ttc acc tct ggc acc acg ggc etc ccc 

295 PQSITDTCLYIFTSGTTGLP 

961 aag get get egg ate agt cat ctg aag ate ctg caa tgc cag ggc ttc tat cag ctg tgt 

315 KAARISHLKILQCQGFYQLC 

1021 ggt gtc cac cag gaa gat gtg ate tac etc gec etc cca etc tac cac atg tec ggt tec 

335 GVHQEDVIYLALPLYHMSGS 

1081 ctg ctg ggc ate gtg ggc tgc atg ggc att ggg gee aca gtg gtg ctg aaa tec aag ttc 

355 IiLGIVGCMGIGATVVLKSKF 

1141 teg get ggt cag ttc tgg gaa gat tgc cag cag cac agg gtg acg gtg ttc cag tac att 

375 SAGQFWEDCQQHRVTVFQYI 

12Q1 ggg gag ctg tgc cga tac ctt gtc aac cag ccc ccg age aag gca gaa cgt ggc cat aag 

395 GELCRYLVNQPPSKAERGHK 



ftaure WA 



1261 gtc egg ctg gca gtg ggc age ggg ctg cgc cca gat acc tgg gag cgt ttt gtg egg cgc 

415 VRLAVGSGLRPDTWERFVRR 

1321 ttc ggg ccc ctg cag gtg ctg gag aca tat gga ctg aca gag ggc aac gtg gec acc ate 

435 FGPLQVLETYGLTEGN^V^ATI 

1381 aa£ tac aca gga cag egg ggc get gtg ggg cgt get tec tgg ctt tac aag cat ate ttc 

455 N YTGQRGAVGRASWLYKHIF 

1441 ccc ttc tec ttg att cgc tat gat gtc acc aca gga gag cca att egg gac ccc cag ggg 

475 PFSLIRYDVTTGEPIRDPQG 

1501 cac tgt atg gee aca tct cca ggt gag cca ggg ctg ctg gtg gec ccg gta age cag cag 

495 HCMATSPGEPGLLVAPVSQQ 

1561 tec cca ttc ctg ggc tat get ggc ggg cca gag ctg gee cag ggg aag ttg eta aag gat 

515 SPFLGYAGGPELAQGKLLKD 

1621 gtc ttc egg cct ggg gat gtt ttc ttc aac act ggg gac ctg ctg gtc tgc gat gac caa 

535 VFRPGDVFFNTGDLLVCDDQ 

16 SI ggt ttt etc cgc ttc cat gat cgt act gga gac acc ttc agg tgg aag ggg gag aat gtg 

555 GFLRFHDRTGDTFRWKGENV 

1741 gec aca acc gag gtg gca gag gtc ttc gag gec eta gat ttt ctt cag gag gtg aac gtc 

575 ATTEVAEVFEALDFLQEVNV 

1801 tat gga gtc act gtg cca ggg cat gaa ggc agg get gga atg gca gee eta gtt ctg cgt 

595 YGVTVPGHEGRAGMAALVLR 

1861 ccc ccc cac get ttg' gac ctt atg cag etc tac acc cac gtg tct gag aac ttg cca cct 

615 PPHALDLMQLYTHVSENLPP 

1921 tat gee egg ccc cga ttc etc agg etc cag gag tct ttg gec acc aca gag acc ttc aaa 

635 YARPRFLRLQESLATTETFK 

1981 cag cag aaa gtt egg atg gca aat gag ggc ttc gac ccc age acc ctg tct gac cca ctg 

655 QQKVRMANEGFDPSTLSDPL 

2041 tac gtt ctg gac cag get gta ggt gec tac ctg ccc etc aca act gec egg tac age gec 

675 YVLDQAVGAYLPLTTARYSA 

2101 etc ctg gca gga aac ctt cga ate tga gaa ctt cca cac ctg agg cac ctg aga gag gaa 

695 LLAGNLRI* 

2161 etc tgt 
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